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Now accepting applications for the Michelson 
Philanthropies & Science Prize for Immunology. 


The Michelson Philanthropies and Science Prize for 
Immunology focuses on transformative research in 
human immunology, with trans-disease applications 
to accelerate vaccine and immunotherapeutic 
discovery. This international prize supports investigators 
35 and younger, who apply their expertise to research 
that has a lasting impact on vaccine development and 
immunotherapy. It is open to researchers from a wide 
range of disciplines including computer science, 
artificial intelligence/machine learning, protein 
engineering, nanotechnology, genomics, parasitology 
and tropical medicine, neurodegenerative diseases, 
and gene editing. 


Application deadline: Oct. 1, 2022. 
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“The Michelson Philanthropies & Science 
Prize for Immunology will greatly impact 
my future work. As | am just starting my 
scientific career, it will illuminate my work, 
spark interest and support me to continue 
my research in this field.” 


Paul Bastard, MD, PhD, 
Laboratory of Human Genetics 
of Infectious Diseases, Imagine 
Institute (INSERM, University of 
Paris), Paris, France; and The 
Rockefeller University, New York. 


Dr. Bastard received the inaugural 
Grand Prize for his essay: “Why 
do people die from COVID-19: 
Autoantibodies neutralizing type 

| interferons increase with age.” 


GRAND PRIZE: 
$30,000 


FINALIST PRIZE: 
$10,000 
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Publish your research in the Science family of journals 


The Science family of journals (Science, Science Advances, Science Immunology, Science 
Robotics, Science Signaling, and Science Translational Medicine) are among the most highly- 
regarded journals in the world for quality and selectivity. Our peer-reviewed journals are 
committed to publishing cutting-edge research, incisive scientific commentary, and insights 
on what's important to the scientific world at the highest standards. 


Submit your research today! 
Learn more at Science.org/journals 


Advertorial 


Since opening its doors in 1986, the Pohang University of Science and Technology 
in South Korea, commonly called POSTECH, has become a hub for groundbreaking 
research, In 2019, POSTECH ranked highest in Asia on Reuters’ list of the World’s 
Most Innovative Universities,’ outflanking historic institutions such as the United 
Kingdom's Cambridge University, Caltech in the United States, and Tsinghua 
University in China. 


The university's commitment to the growth of high-quality research is 
demonstrated by its investment in basic science research, which necessarily 
underpins and advances the organization’s well-established applied science work. 

For example, Moon Jeong Park leads a team within the Department of 
Chemistry that focuses on elucidating the interplay of transport and morphology 
in nanostructured charged polymer materials; the team’s work is based on a 
fundamental understanding of molecular interactions. 

Her team is currently working on developing new ion-containing polymeric 
materials with tailored molecular interactions that are more predictable, 
efficient, and sustainable for energy storage and transport. Advances in this 
area could include the development of artificial muscles, electric cars, and 
wearable devices. 


Passion drives basic research at POSTECH 

Thirteen years ago, Park moved from the University of California, Berkeley, 
back to South Korea. She recalls some challenges convincing colleagues in the 
United States who thought her work on such materials would not be practical 


'https://www.reuters.com/innovative-universities-2019. 


Dr. Moon Jeong Park (second from right) discusses her research with students in her lab. 


The need for basic scientific research in an applied age 


because of their low ionic conductivity and limited performance at high current 


densities. However, POSTECH supported her research, and over the past decade, 


her work has proved to be highly relevant to the field of battery technology. 
Scientific research into batteries has seen a surge in activity and investment 
due in large part to the growing demand for electric vehicles. 

One major drawback of the most commonly used vehicle batteries is that 
they contain ions that are moved by a potentially flammable liquid electrolyte. 
In order to create the next generation of energy storage devices, Park’s team 
has worked on developing highly efficient solid-state polymer electrolytes to 
replace the more unstable compounds. 

“Now, it is becoming a common goal to try to obtain high conductivity and 
high transference numbers from solid-state polymer electrolytes. It is my belief 
that eventually this kind of fundamental research will drive and direct research 
in this field,” says Park. 

Her team recently developed a polymer electrolyte with a novel 
nanostructure that demonstrated a significant enhancement in ionic 
conductivity compared to previous versions. “This provides a potential route to 
accelerate the commercialization of safer, all-solid-state batteries,” she says. 


Tools that enable basic scientific research 

to flourish 

Park appreciates the challenges posed to early-career researchers who 
conduct fundamental research. As opposed to applied research, basic research 
can appear frustratingly time-consuming. The key to motivating her team is to 
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ensure that “fundamentals learned in school, and the ability to synthesize and 
analyze your own materials, will form a solid foundation for growing as a leader 
as you transition to becoming an independent researcher,” she says. 

She speaks from personal experience. The reason Park was able to gain 
international recognition 
for her work at a sucha 


young age was due to the 
level of support offered by 
the university, she says. 
“POSTECH encourages 
participation in international 
conferences and journal 
activities. Also, there are 

no major difficulties in 
purchasing equipment and 
setting up a lab.” 

This supportive research 
environment leads to a highly 
cooperative and collaborative 
working style. “Even if | call 
for advice or support on 
a Saturday evening, there 
are staff members who are 
always there to help,” she says. 


Small, research-focused community 

For scientists whose research focus is long-term and necessarily involves 
steady and incremental steps, POSTECH is an ideal environment, says Young- 
Tae Chang. 

In Chang's lab, which is based in the Department of Chemistry, his team 
develops bioimaging probes for live cells. To understand complex biosystems, 
it is imperative to be able to discriminate between live and dead cells. Chang 
and his colleagues employ a diversity-oriented fluorescence library approach 
(DOFLA) to screen novel fluorescent molecules and have so far developed 30 
cell-selective probes. 

Chang started his research career at POSTECH, as one of the university's 
first batch of students. After finishing his Ph.D., he moved to the United 
States and then to Singapore, where he spent 20 years exploring the different 
scientific environments in those two countries before deciding to come back 
to his alma mater. “| missed the small and research-focused environment of 
POSTECH,” he says. 

His passion for his work comes from the promise of scientific discovery and 
its limitless possibilities for improving human health. “We have genes that 
generate millions of antibodies to fight any antigen. If we can correctly identify 
these antigens, it seems like a magic solution for any problem,” he says. 

After two decades working in this field, Chang hopes to use his research to 
tackle issues around the aging of the human body. He hopes that his probes, 
which can correctly discriminate between old and young cells, will help identify 
novel biomarkers for the aging process. 


Dr. Young-Tae Chang's (left) research team develops bioimaging probes for use in live cells. 
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“While the study of how our bodies age is attracting a lot of attention these 
days, the systematic discovery of biomarkers is far behind other fields, such 
as cancer research,” he says. Chang hopes that once scientists identify the 
molecules involved in aging, it will open the way to discovering a possible 
molecular solution to slow or 
even reverse the process. 

As a chemical biologist, 
Chang believes that adopting 
a molecular approach to 
human health interventions 
is safer than taking a 
gene-editing approach. 

“The sensor and molecular- 
imaging probes will provide 
unprecedented new tools for 
clinical checkups," he says. 

Aging-specific probes 
could make it possible to 
monitor specific molecules 
in our bodies and offer a 
guide to the early detection 
of problems, he adds. “The 
aging rate is not the same for 
different organs. And the failure of a single organ will determine our lifespan, 
not the collapse of our entire body at once. Therefore, probes that detect age- 
related biomarkers may be able to guide preventative interventions as well as 
act as agents to reverse aging,” says Chang. 


The future for basic science at POSTECH 

Scientists at POSTECH are extremely optimistic about the future of fundamental 
research at their institution. The campus has a high concentration of top-level, 
motivated students and offers the right conditions to produce outstanding 
research, says Chang. Park adds that the superior research environment, which 
includes a synchrotron facility, attracts scientists from all over the country. 
This enables plentiful opportunities for collaboration and builds the university's 
reputation as a hub for carrying out cutting-edge science. 

“POSTECH was founded in the pursuit of being a world-class, research- 
oriented university, and | think all faculty members work hard to exemplify our 
founding spirit,” says Park. But it is important for people who do basic science 
not to get too caught up in their own research worlds, she adds. Park believes 
that having an open culture of collaboration creates positive feedback loops 
that benefit everyone's work. 
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EDITORIAL 


Rethinking the retraction process 


igh-profile examples of scientific fraud con- 
tinue to plague research. Recently, Science 
published two news stories on alleged image 
manipulation in Alzheimer’s research and un- 
reliable data in an ecology study, sadly showing 
that the problem persists. Each case involved 
back and forth among the journal, authors, and 
institutions to correct the scientific record. Journalists 
and advocates for research integrity (including coura- 
geous whistleblowers) are understandably frustrated 
about how long it takes to retract papers or at least 
to post editorial expressions of concern. It’s time to 
devise a more efficient solution. 

Most journals follow guidelines for research mis- 
conduct promoted by the Committee on Publication 
Ethics (COPE). Accordingly, should Science receive a 
report of alleged image manipu- 
lation or other problems, our 
first step is to contact the au- 
thors. Most of the time, authors 
will deny the problem or insist 
it can be solved with a minor 
correction. Much of the time, 
we know that neither answer 
is satisfactory but before mov- 
ing to retract, our next step is 
to contact the authors’ institu- 
tions. This is because we’re not 
an investigatory body. Again, 
the most likely answer we will 
get is inconclusive, if there is an 
answer at all. According to COPE, we’re then supposed 
to contact the institution every 3 to 6 to months for 
a response. And this exasperating wait can last for a 
year or more. 

So, what’s the alternative? We could be more aggres- 
sive in retracting things without consulting as widely, 
but that could have a very big impact on scientists’ 
careers, and a mistake would be costly. We could try to 
get the universities to respond more quickly and more 
transparently. As a former provost myself, I received 
many such inquiries over the years. Now that I’ve 
experienced both sides of this interaction, I’ve been 
thinking about how the process could be improved. 

In 2000, the US Department of Health and Human 
Services issued a lengthy and legalistic set of regula- 
tions for what universities are supposed to do about 
research misconduct. A quick read of this labyrinth 
makes it easy to see how university administrators 
become paralyzed by the fear of running afoul of the 


"advocates for 
research integrity... 


are...frustrated about 
how long it takes 
to retract papers...” 


agency’s regulations. Add to this the possibility that 
an unhappy author could threaten litigation against 
the institution, drawing legal counsel and public rela- 
tions into the equation, and the only possible outcome 
is gridlock. With a university’s reputation at risk and 
legal action looming, the last thing on their mind is 
the paper that is under dispute. 

The journals have a different goal. Science just 
wants to know if the paper should be retracted or not. 
Although we're curious about the details—and some- 
times our news division, which is editorially inde- 
pendent of the research department, might decide to 
look deeper—the matter of whether the problem was a 
rogue postdoc or graduate student, sloppy science, or 
outright fraud by the principal investigator is not for 
us to adjudicate. At the same time, the university is 
focused on the personnel ques- 
tions, making them prone to 
err on the side of revealing less 
than the full story. 

The solution would be for 
the universities, federal govern- 
ment, funding agencies, and 
journals to come together and 
agree that these investigations 
should be a two-stage process. 
The first stage should evaluate 
the validity of the paper with- 
out attributing blame. The uni- 
versity would then feel free to 
determine the validity of the 
paper before it plunges into a lengthy and more com- 
plicated investigation of the underlying wrongdoing. 
If the paper is not valid, it can then be retracted much 
more quickly. The second stage, with journals out of 
the picture, would be for the university to determine 
whether there was fraud that rises to the level of re- 
search misconduct. This plan would accelerate correc- 
tion of the scientific record. 

I asked prominent research integrity advocate Elisa- 
beth Bik what she thought of this. “It would be ben- 
eficial for misconduct cases to split the investigations 
into two,” she said. “On one hand, there is the question 
whether the data in a paper is reliable or not, while 
on the other hand, one can question who has been re- 
sponsible for generating that data. These are two sepa- 
rate questions.” 

Indeed, they are separate, and the process to ad- 
dress retractions needs to change. 

-H. Holden Thorp 


H. Holden Thorp 
Editor-in-Chief, 
Science journals. 
hthorp@aaas.org: 
@hholdenthorp 
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BUILDING ON A 
MILLENNIUM OF 


ARAB INNOVATION 


The Middle East was once at the forefront of invention. Algebra, 
universities, inoculation against disease, coffee, windmills, the 
arch and even the three-course meal: all had their genesis in 
the region. Yet over the past 150 years, the focus on innovation 
on a global scale has not been as sharp as it once was. 


That’s why, as we enter the second quarter of the Digital 
Century, the Kingdom of Saudi Arabia will reinvigorate that 
inventive tradition with a range of new research, development 
and innovation priorities aimed at addressing the most pressing 
issues affecting mankind today. 


Through a focus on Health and Wellness, Sustainable Environment 
and Supply of Essential Needs, Energy and Industrial Leadership, 
and Economies of the Future, the newly established Research, 
Development and Innovation Authority (RDIA) will be a steward 
for the Kingdom’s work with global partners, deploying a multi- 
billion-dollar budget over the coming years. 


We are determined to help make world-class healthcare 
accessible and equitable across society. As an existing leader 
in digital health, the Kingdom will continue investing to advance 
the technology even further. As we demonstrated during the 
Covid-19 pandemic, this is an area in which we lead the world. 


Saudi Arabia is intent on addressing not only domestic but 
also global health challenges through focusing on ultra-early 
disease detection and biotechnology to prevent the blight of 
illnesses such as cancer and chronic heart disease from ending 
lives prematurely. 


A good life isn’t limited to good health. Security of food, water 
and energy are vital for happiness and wellbeing. 


The world’s largest solar-powered desalination plant is located in 
Al Khafji in Saudi Arabia and we continue to invest in innovation 
to make desalination environmentally friendly, more accessible to 
many more households and businesses, cheaper to achieve and, 
crucially for food security, able to be used easily in farming. 


Saudi Arabia has prospered over the past century thanks, in 
large part, to our oil revenue. We’re now investing that revenue 
in finding cleaner, greener sources of energy to power our 
way into a brighter future. The Kingdom is building what will be 
the world’s largest green hydrogen facility, with a production 
target of 50 tonnes a day, aiming to produce 5 million tonnes 
of clean hydrogen by 2030. 


Though the innovation and invention timescale can be 
frustratingly long, the Kingdom is committed to becoming an 
innovation superpower in the next two decades. We expect 
that by 2040, our total expenditure on innovation will reach 
2.5% of GDP, more than the G20 average. 


We can’t do this alone. 


At the heart of our commitment to innovation is the 
desire to build globally collaborative schemes and 
institutions which, together, will help mankind harness 
technology to improve lives. Saudi Arabia will invest in 
bringing scientific talent from other nations to work with 
our own brightest minds. 


KACST (the King Abdulaziz City for Science and Technology) 
- the Kingdom’s leading research and innovation centre — is 
already working with Cambridge University, Harvard, Berkley 
and MIT to pool knowledge, resources, information, and 
ability. As our programme expands in capability, so will our 
world advance in possibility. 


Innovation, investment and people: the Research, 
Development and Innovation Authority brings all these 
together. Building on 1,500 years of our region's exceptionally 
inventive history, the Kingdom of Saudi Arabia will use 
the fruits of our prosperity to build a better future for 
everyone, everywhere. 


His Excellency Munir Eldesouki - President of King Abdul Aziz 
City for Science and Technology (KACST) 
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464 Wellcome remains an institutionally racist organisation. 99 

Jeremy Farrar, director of the Wellcome Trust, in Times Higher Education, announcing a new push to support 

applicants from diverse backgrounds in its grantmaking, but with no details on funding. A report found the large 
biomedical research charity's antiracism program, launched 2 years ago, has made limited progress. 
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COVID-19 


U.K. OKs anti-Omicron vaccine 


he United Kingdom this week became the first country to approve 
an updated COVID-19 booster directed at two different strains 
of the coronavirus. The “bivalent” booster, made by Moderna, 
will have the same dose of messenger RNA as the company’s 


least two dozen sewage sampling sites in 
London and across the country to check the 
full extent of the spread. 


CDC puts health equity on the map 


PUBLIC HEALTH | The U.S. Centers for 
Disease Control and Prevention (CDC) 

and other health agencies have launched 
the first national index of how pollution, 
illness, and poverty affect health equity. The 
Environmental Justice Index, which debuted 
last week, combines several existing indices 


of environmental justice—based on data 
from CDC, the Environmental Protection 
Agency, and other sources—into an overall 
measure of environmental harm to health 
and well-being. The maps display data down 
to the scale of neighborhoods. That’s a use- 
ful innovation, says Philip Alberti, senior 
director of health equity research and policy 
at the Association of American Medical 
Colleges. “The impacts of environmental 
injustice tend to be hyperlocal.” Another 


prior boosters but will target both the original version of the 

coronavirus and the first Omicron variant, BA.1, which emerged 
in late 2021. Clinical trials and other research by Moderna suggest its 
retooled booster generates a stronger immune response against BA.1 
and other versions of Omicron now in circulation than its existing vac- 
cine, introduced in December 2020. The new shot will be available in 
the fall. A similar booster is expected in the United States, although in 
June the Food and Drug Administration asked Moderna and a second 
manufacturer, Pfizer, to include the spike protein component for more 


recent versions of Omicron, BA.4 and BA.5, rather than BA.1. 


Ahealth school’s colonial past 


HISTORY OF SCIENCE | Areport on the 
history of the London School of Hygiene & 
Tropical Medicine (LSHTM) has found the 
prestigious institution “embraced British 
colonialism and the notions of racism and 
white supremacy.” The review, commissioned 
by LSHTM and released last week, describes 
the institute’s history from its founding in 
1899 to 1960, when it was funded by the 
government ministry that administered 
colonies and overseas territories and by 
companies with commercial interests there. 
The school’s research sought to protect the 
health of Europeans in the colonies and the 
companies’ profits rather than the health 

of the other people who lived there, the 
report concludes. It details an experiment 
by physician Patrick Manson, before he 
founded the school, in which he exposed his 
servant to nematode-carrying mosquitoes to 
study the parasite’s life cycle. The investiga- 
tion was recommended by a 2020 report 

on racism and equality commissioned by 
LSHTM, which has prompted the school to 
adopt new measures to improve its diversity, 
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equity, and inclusion. LSHTM will offer a 
new funding scheme for Ph.D. scholars at 


partner institutions in five African countries. 


Polio boosters for London kids 


INFECTIOUS DISEASES | Citing new data 
indicating poliovirus continues to circulate 
in at least eight of London’s 32 boroughs, 
U.K. authorities recommended last week 
that all the city’s children between the ages 
of 1 and 9 receive an additional booster 
dose of polio vaccine. The virus, detected 
by routine wastewater samples starting in 
February, is a type derived from the oral 
polio vaccine, which contains live, weakened 
virus. Virus in the live vaccine, which is no 
longer used in richer countries, can spread 
and mutate into a version that can cause 
paralysis in unvaccinated people. Travelers 
probably introduced the virus into the 
United Kingdom; the country has had no 
cases of polio so far, but genetic analysis of 
the samples suggests the virus is spreading 
beyond a small group of people, accord- 
ing to a 10 August report by the UK Health 
Security Agency. Authorities are adding at 


novelty is a focus on the combined effects 
of multiple health inequities. CDC hopes 
the website will help public health officials 
respond to areas with the highest risk of ill- 
ness, set goals, and track progress. 


3D data may aid blind scientists 


DIVERSITY | Visually impaired scientists 
often rely on text readers to read journal 
articles, but until now there has been no 
easy way to render graphics and data in 

a way they could perceive. A group of 
researchers has developed a solution: 3D 
print a lithophane, a thin plastic sheet that 


3D-printed lithophanes can help optically impaired 
scientists “see” data, such as from protein separation 
gels, with their fingertips. 
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IN FOCUS Scientists rallied outside Canada’s Parliament on 11 August, carrying a 70-meter-long letter 

with more than 7000 signatures. The letter to lawmakers calls for increases in the stipends paid by graduate 
student scholarships and postdoctoral fellowships awarded by the Natural Sciences and Engineering Research 
Council of Canada. “We can’t do science if we can’t pay rent,” one rallygoer’'s sign read. 


displays visual data in the form of bumps 
and other surface features. Users can run 
their finger across the surface and perceive 
features of graphs such as data points, 

axes, and curves. For a stained protein or 
nucleic acid gel, a lithophane can represent 
the intensities of the stain as variations in 
relief. 3D printers have become widely avail- 
able, and the printing only takes minutes. 
Bryan Shaw, a chemist at Baylor University, 
and his team collaborated with four opti- 
cally impaired scientists to develop and 
describe the method on 17 August in Science 
Advances. (See a Science interview with 
them at https://scim.ag/LithophaneQA.) 


NAS can suspend, not just expel 


MISCONDUCT | The US. National Academy 
of Sciences (NAS) has changed its bylaws to 
permit members to be temporarily sus- 
pended, rather than permanently ejected, 
from the prestigious body if they breach 

its code of conduct, Science has learned. 
Separately, the academy made public that 
last week it has for 5 years barred a member, 
Jane Lubchenco, a White House official and 
former head of the National Oceanic and 
Atmospheric Administration, from receiving 
NAS honors and working on publications or 
programs for the academy or the National 
Research Council. The bylaw change was 
approved on 24 June in an online vote 

open to all 2500 NAS members, with more 
than 75% of participating members vot- 

ing in favor. Requested by the standing 
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committee that reviews alleged violations of 
the academy’s code of conduct, the change 
“will allow the NAS to take meaningful 
actions on conduct issues with less severe 
consequences,” its president, Marcia McNutt, 
said in a statement to Science. A two-thirds 
vote of NAS’s 17-member governing council 
will be required to restore a suspended 
membership. The new policy modifies 

one approved in 2019 that allows the body 
to expel members who breach its code, 
which prohibits scientific and financial 
misconduct, discrimination, harassment, 
and bullying. NAS has since permanently 
ejected at least three members: evolution- 
ary biologist Francisco Ayala, astronomer 
Geoffrey Marcy, and Peruvian archaeologist 
Luis Jaime Castillo Butters. The sanctions 
on Lubchenco, who is now deputy director 
for climate and environment in the Office 

of Science and Technology Policy, stem from 
her role in editing a paper in the Proceedings 
of the National Academy of Sciences on 
which her brother-in-law was an author. She 
has previously apologized for the conflict of 
interest and now accepts the sanctions. 


Museum to bury disputed skulls 


ANTHROPOLOGY | The University of 
Pennsylvania’s archaeology museum is 
planning a proper burial for 13 skulls of 
Black residents of Philadelphia it has held 
since 1966. The skulls are part of a contro- 
versial collection of more than 1300 skulls 
at the museum. They were gathered in the 


mid-19th century by anthropologist Samuel 
Morton and others, and used by Morton to 
justify racist ideas about human intelligence. 
The skulls to be reburied were dug up from 
the Blockley Almshouse, a charity hospital 

at the site where the museum now stands, 
and probably came from enslaved people, 

a 2021 report said. The museum and some 
observers see the planned reburial, in the 
historically Black Eden Cemetery, as a step 
toward reconciliation. But some members of 
the city’s Black community say the museum's 
decision was rushed and descendants should 
have the final say in where the remains are 
laid to rest. The disposition of the remaining 
skulls in the collection has yet to be resolved. 


Accident kills 21,000 research fish 


LAB SAFETY | Chlorine poisoning likely 
caused the deaths last week of more than 
21,000 fish at a research center on aquatic 
biology and aquaculture at the University of 
California, Davis. About three-quarters of the 
fish lost were part of ecological physiologist 
Nann Fangue’s research into the conserva- 
tion and physiology of native California 
species. The animals lost included green and 
white sturgeon, one at least 20 years old, 
and endangered Chinook salmon. Chlorine, 
which is used for disinfection, is toxic to fish 
even in small amounts. The university apolo- 
gized for the unprecedented “catastrophic 
failure” and is investigating the incident and 
reviewing plumbing and procedures at the 
center and elsewhere on campus. 
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Small stowaways on new NASA 
rocket promise big science 


Batteries allowing, CubeSats will target lunar ice and more 


By Eric Hand 


hen NASA’s most powerful rocket 

ever attempts its first flight this 

month, its highest profile payload 

will be three instrumented manne- 

quins, setting off on a 42-day jour- 

ney beyond the Moon and back. 

They are stand-ins for the astronauts that 

the 98-meter-tall rocket, known as the Space 

Launch System (SLS), is supposed to carry to 

the Moon as soon as 2025, as part of NASA’s 

Artemis program. But when the SLS lifts off 

on 29 August, other voyagers will be along 

for the ride: 10 CubeSats, satellites no bigger 

than a small briefcase, to probe the Moon, as- 
teroids, and effects of deep-space radiation. 

The teams that built those satellites have 

more than the usual launch jitters: Half of the 

devices may not have enough power to begin 

their missions. Stuck within the rocket for 

more than a year because of launch delays, 
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their batteries have drained and some may 
be unable to boot up and unfurl solar panels. 
“The longer we wait, the higher the risk,” says 
Ben Malphrus of Morehead State University, 
principal investigator for Lunar IceCube, one 
of the probes with power concerns. 

At stake is not just data, but a test of 
CubeSats as deep-space probes. “We're in the 
transition phase from being a curiosity and 
a training tool to being a platform for real 
science,’ Malphrus says. CubeSats are easy 
to assemble out of standardized parts, from 
thrifty ion propulsion systems to pint-size ra- 
dio transmitters. That lets researchers focus 
on developing instruments capable of gather- 
ing novel data—if they can be shrunk into a 
CubeSat package. 

At millions of dollars a pop compared with 
hundreds of millions for a bigger, stand-alone 
satellite on its own rocket, the spacecraft can 
take on riskier missions, including hitchhik- 
ing on the unproven SLS. “When it comes to 


NASA's Space Launch System, seen here in a June 
dress rehearsal, will be its most powerful rocket ever. 


CubeSats, failure is an option,’ Bhavya Lal, 
NASAs associate administrator for technol- 
ogy, policy, and strategy, said at a briefing 
earlier this month. 

Several SLS CubeSats will focus on lunar 
ice, which has intrigued researchers ever 
since NASA’s Lunar Prospector discovered a 
signal suggestive of water in the late 1990s. 
Using a neutron detector, it peered into frigid, 
permanently shadowed regions in polar cra- 
ters. In many, the probe detected a curious 
suppression of neutrons—best explained by 
extra hydrogen in the topmost meter of soil. 

Researchers assume much of the hydrogen 
represents water ice trapped in the craters’ 
coldest, darkest recesses. Ancient impacts of 
comets or asteroids could have delivered the 
ice, but the hydrogen could also be implanted 
by the solar wind. When hydrogen ions in the 
wind strike oxygen-bearing minerals in lunar 
soil, the barrage creates hydroxyl, which can 
be transformed into water through subse- 
quent reactions. If the Moon holds enough 
water, it could be used for agriculture and 
life support, and split into hydrogen and oxy- 
gen for rocket propellant. “That will be more 
economical than bringing it from Earth,’ says 
Hannah Sargeant, a planetary scientist at the 
University of Central Florida. 

The Lunar Polar Hydrogen Mapper (LunaH 
Map), led by Craig Hardgrove of Arizona 
State University, Tempe, will attempt to im- 
prove on Lunar Prospector’s maps with a dar- 
ing orbit that swoops just 12 to 15 kilometers 
above the Moon’s south pole. Over the course 
of 280 passes, the team hopes the probe’s neu- 
tron detector will map excess hydrogen with 
a resolution of 20 to 30 kilometers, about 
twice as good as Lunar Prospector. “We can 
distinguish one [deep crater] from another,’ 
Hardgrove says. Craters lacking hydrogen, 
or enrichments outside the frigid hideouts, 
might indicate a relatively recent impact that 
blasted out ice and redistributed it, he says. 

Another CubeSat, Lunar IceCube, will 
carry a spectrometer that can detect the 
infrared fingerprints of either water or hy- 


below the crew capsule on NASA's new rocket. 
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droxyl. Because the device depends on re- 
flected light, it will be most sensitive to signs 
of hydroxyl and water in sunlit regions at 
lower latitudes. “They’re really looking at the 
[effect of] the solar wind, day by day,’ says 
Benjamin Greenhagen, a planetary scientist 
at the Johns Hopkins University Applied 
Physics Laboratory. 

Some of the CubeSats are headed beyond 
the Moon. After the SLS leaves Earth’s or- 
bit and releases the probes, Near-Earth 
Asteroid Scout (NEA Scout) will unspool 
a thin solar sail the size of a racquetball 
court. Propelled by photons, it will navigate 
to 2020GE, a miniature asteroid between 
5 and 15 meters across. About 2 years from 
now, it should sail as close as 800 meters to 
the asteroid in a 3-hour flyby. Many larger 
asteroids are loosely bound rubble piles, 
but NEA Scout will test the expectation that 
the faint pressure of sunlight has spun up 
2020GE too fast for it to hold any rubble, 
says Julie Castillo-Rogez, NEA Scout’s sci- 
ence principal investigator at NASA’s Jet 
Propulsion Laboratory. 

BioSentinel, led by Sergio Santa Maria, a 
biologist at NASA’s Ames Research Center, 
will carry out NASA’s first test of the bio- 
logical effects of radiation beyond low-Earth 
orbit since the last Apollo mission in 1972. Un- 
protected by Earth’s magnetic field, organisms 
are more vulnerable to DNA damage caused 
by solar outbursts and galactic cosmic rays. 
BioSentinel will carry strains of yeast in hun- 
dreds of microscopic wells to a Sun-orbiting 
perch beyond the Moon. There, its optical 
sensors will gauge the health of the yeast 
strains as they accumulate radiation damage 
by measuring cell growth and metabolism. 

BioSentinel, NEA Scout, and three other 
CubeSats were allowed to recharge their 
batteries during their long wait aboard 
the SLS. But five others were out of luck, 
including both LunaH Map and Lunar 
IceCube. Some could not be recharged 
without removing them from the rocket; 
in other cases NASA engineers feared the 
process might spark discharges that could 
harm the rest of the rocket. “We have to be 
very cognizant of the risk to the primary 
mission when we interface with these 
CubeSats,’ says Jacob Bleacher, NASA’s 
chief exploration scientist. 

Hardgrove says LunaH Map’s battery re- 
serve is probably at 50%; at 40% it will not 
be able to run through initial operations 
and maneuvers before the solar panels can 
unfurl and begin to recharge the batteries. 
He pushed hard for the opportunity to re- 
charge but says he was rebuffed by NASA. 
“You can’t agree to take stowaways and 
then kill them,” he says. Still, Hardgrove is 
resigned to rolling the dice. “It wouldn’t be 
a CubeSat mission if you weren’t anxious.” 
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China rises to first place in one 
key metric of research impact 


Other methods still put the United States somewhat ahead 


By Jeffrey Brainard and Dennis Normile 


or the first time, China has slightly 
edged out the United States in the 
number of most cited papers, a key 
measure of research impact, accord- 
ing to a Japanese science policy in- 
stitute. The milestone provides fresh 
evidence that China’s scholarship, known 
for its burgeoning quantity, is catching 
up in quality as well. “People are writing 
off China, [saying] they’re putting out a 
lot of stuff but it’s not good quality,” says 
Caroline Wagner, who studies science pol- 
icy and innovation at Ohio State Univer- 
sity, Columbus. “That’s just short-sighted.” 

Scholars disagree about the best meth- 
odology for measuring publications’ im- 
pact, however, and other metrics suggest 
the United States is still ahead—but barely. 

For the new report, Ja- 
pan’s National Institute of 
Science and Technology Pol- 
icy (NISTEP) tallied the top 
1% papers in terms of cita- 
tions, a rarified stratum inhab- 
ited by many Nobel laureates. 
Many such elite articles have 
authors from multiple coun- 
tries, however, which com- 
plicates the analysis. In one 
study, NISTEP used a method 
called “fractional counting” 
to divide the credit. If, for ex- 
ample, one French and three Swedish institu- 
tions contributed to a paper, France received 
25% of the credit and Sweden 75%. 

Using that measure, China accounted for 
27.2% of the most cited papers published in 
2018, 2019, and 2020, and the United States 
for 24.9%. Next was the United Kingdom, 
with 5.5%; Japan was in 10th place. (U.S. 
researchers were still slightly ahead when 
NISTEP used a less fine-grained method that 
credits every country that contributed to a 
highly cited paper equally, regardless of how 
many of its institutions were involved.) 

Cao Cong, a science policy scholar at 
the University of Nottingham’s campus in 
Ningbo, China, says the methodology may 
overstate China’s contributions to interna- 
tionally co-authored papers. “The question 
is who—the Chinese or their international 


“The question is 
who-the Chinese or 
their international 
collaborators 
led the [high-impact] 
studies.” 


Cao Cong, 
University of Nottingham, Ningbo 


collaborators—led the studies,” he says. 

Still, China’s rising production of top- 
cited papers is “remarkable,” NISTEP says; 
2 decades ago it only ranked 13th in the frac- 
tional counting metric. In recent years, crit- 
ics faulted the quality of Chinese research, 
pointing to policies—now being phased 
out—that provided professional rewards for 
authors based on the sheer number of papers 
published. They also noted that China-based 
paper mills, which provide researchers with 
authorship slots in exchange for money, ap- 
pear to be growing in number. 

Other measures still put the United 
States ahead. The State of U.S. Science and 
Engineering 2022, a report published by the 
U.S. National Science Foundation (NSF) in 
January, addressed one problem with re- 
ports such as NISTEP’s: By counting num- 
bers of highly cited papers, they favor big 
countries and those that 
spend more on research, 
just as big countries tend 
to do better in Olympic 
medal rankings. NSF in- 
stead measured the share 
of each country’s papers 
that are highly cited, which 
allows for comparisons 
across countries regardless 
of how much they publish. 
Its analysis showed U.S. 
papers were highly influ- 
ential: Of those published 
in 2018, more than twice as many ended 
up among the 1% most cited papers as ex- 
pected based on the country’s total output. 
China published 20% more top-cited papers 
than would be expected. 

Another paper, published by Wagner and 
others in 2020, concluded China’s research is 
slightly more innovative than the world aver- 
age. That study tracked how often papers’ ref- 
erence lists included atypical combinations 
of journals in disparate fields as a proxy for 
innovative new ideas. 

The impact of publications is just one mea- 
sure of a country’s scientific prowess, how- 
ever. The United States still leads in other 
indicators, such as the number of doctorates 
awarded—but there’s little doubt China’s sci- 
entific enterprise is catching up with the rest 
of the world at an unprecedented speed. 
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New law’s big payout for farming 
has uncertain climate payoff 


Measures to capture carbon in soil may be less effective 


than hoped, scientists say 


By Erik Stokstad 


hen settlers plowed the North 
American prairie, they uncovered 
some of the most fertile soil in 
the world. But tilling those deep- 
rooted grasslands released mas- 
sive amounts of underground 
carbon into the atmosphere. More green- 
house gases wafted into the skies when 
wetlands were drained and forests cleared 
for fields. Land conversion continues today, 
and synthetic fertilizer, diesel-hungry farm 
machinery, and methane-belching livestock 
add to the climate effects; all told, farming 
generates 10% of climate-affecting emis- 
sions from the United States each year. Now, 
Congress would like to turn back the clock 
and return some of that carbon to the soil. 
The Inflation Reduction Act, a broad 
bill signed into law this week, has historic 
climate provisions, including massive sub- 
sidies for clean power and electric vehicles 
(Science, 5 August, p. 564). But lawmakers 
also included more than $25 billion to ex- 
pand and safeguard forests and promote 
farming practices thought to be climate 
friendly. Those include no-till agriculture 
and “cover crops,” plants cultivated simply 
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to protect the soil. Researchers, environ- 
mental groups, and the farm industry agree 
that paying and training farmers to adopt 
those measures will improve soil health and 
water and air quality. “I think pretty much 
everyone across the board is pretty happy,” 
says Haley Leslie-Bole, a climate policy ana- 
lyst with the World Resources Institute. But 
how much these practices will slow global 
warming is unclear. 

“It’s probably going to be positive, but 
how positive we don’t really know yet,” says 
Jonathan Sanderman, a soil scientist at the 
Woodwell Climate Research Center. A ma- 
jor factor is whether the U.S. Department 
of Agriculture (USDA) spends the money 
on the practices most likely to have climate 
benefits. Another challenge is measuring 
and quantifying the reductions, a task com- 
plicated by the great diversity of U.S. land 
and farming practices and the complex bio- 
geochemistry of the carbon cycle. 

Sanderman and others think bigger cli- 
mate gains could come from other changes 
in farming, such as lessening emissions 
from fertilizer and livestock. And climate 
and agriculture expert Tim Searchinger of 
Princeton University sees a need for more 
research on climate-beneficial farming 


After harvest, a Maryland cornfield is planted with 
a cover crop—a practice that boosts soil carbon. 


practices. USDA “needs to come up with an 
ambitious, creative plan and include a re- 
ally good system for tracking progress.” 

U.S. farmers have long received payments 
for conserving soil and reducing the pollu- 
tion that can run off their fields. The new 
bill expands funding for those programs, al- 
lowing more farmers to get a per-acre pay- 
ment for a wide range of activities expected 
to reduce or sequester carbon emissions, 
including no-till and cover crops. 

Yet how those practices will affect the 
soil’s ability to accumulate and hold car- 
bon depends on a complex interplay be- 
tween plants, which add organic material 
to the soil, and microbes and other tiny 
organisms that break down everything 
they can, sometimes releasing greenhouse 
gases. Those interactions vary enormously 
with soil type and environmental condi- 
tions, and it can take years for the impact 
of changes in crop types or farming tech- 
niques to emerge. 

Planting cover crops such as cowpeas and 
vetch, for example, boosts soil carbon on av- 
erage each year by one-third of a ton per 
hectare, according to a 2015 meta-analysis 
in Agriculture, Ecosystems & Environment. 
The cover crops help control weeds and 
make the soil more porous, says Cristine 
Morgan, scientific director of the Soil Health 
Institute. “Grab a shovel and jump on it,” 
she says. “And the shovel goes deeper.” Yet 
the net climate benefits remain uncertain. 
On the one hand, leguminous cover crops 
such as cowpeas and vetch add nitrogen 
to the soil, which microbes can transform 
into nitrous oxide, a potent greenhouse gas. 
But farmers might then add less fertilizer, 
which would reduce emissions. 

Another reputedly climate-friendly prac- 
tice is no-till farming, which is growing in 
popularity. By not plowing fields, farmers 
protect the topsoil from erosion. They also 
save on diesel, which benefits their bottom 
line and reduces carbon dioxide emissions. 
Crop yields can increase as carbon accumu- 
lates in the upper root zone, enriching the 
soil and helping it retain moisture. 

At the same time, studies have shown that 
carbon decreases in deeper layers of certain 
soils, making the net change small or zero. 
And during the first decade, no-till fields 
release higher levels of nitrous oxide as mi- 
crobes break down crop residues, although 
the levels eventually decline. A further com- 
plication is permanence: Few U.S. farm- 
ers continue the practice uninterrupted. 
Factors such as the growth of herbicide- 
resistant weeds and the need to rotate 
crops eventually convince them to plow 
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again, which can allow nearly one-third 
of the accumulated carbon to escape into 
the atmosphere. 

Still, Keith Paustian, a soil scientist at 
Colorado State University, and others argue 
that together, these regenerative soil prac- 
tices are powerful climate tools. “We can 
regain potentially quite a fair amount of 
that lost carbon,” he says, by combining no- 
till and cover crops with other steps, such 
as planting trees on farms and improving 
grazing management to revitalize pastures. 
“We need to get agriculture to a new para- 
digm where these kinds of practices are the 
norm and not the exception,” he says. 

In addition to fields and forests, the bill’s 
climate-related provisions target livestock, 
which emit methane that accounts for 27% 
of all greenhouse emissions from U.S. ag- 
riculture. As part of one USDA program 
awarded $8.5 billion, the bill instructs the 
agency to prioritize projects that will reduce 
these emissions from cows and other rumi- 
nants by modifying their feed. An additive 
called 3-NOP can cut methane burped by 
dairy cattle by about 30% and is used in the 
United Kingdom and Europe. But it hasn’t 
been greenlighted yet by the U.S. Food and 
Drug Administration, which considers it a 
drug, slowing the approval process. 

Searchinger and others hope USDA will 
also emphasize the incentives the bill pro- 
vides to cut fertilizer use. Nitrification in- 
hibitors, compounds that slow microbial 
conversion of fertilizer into nitrous oxide, 
can help farmers apply less and also cut 
emissions, he and others say. Farmers may 
be receptive because of fertilizer short- 
ages and high prices. USDA this week 
said it will streamline applications for 
fertilizer management. 

Many climate advocates say the most cost- 
effective way to help the climate through 
agriculture is simply to farm less land and 
raise less livestock. That means persuading 
farmers not to convert grasslands or other 
carbon-rich lands to row crops such as corn 
and soybeans. But the bill includes no addi- 
tional funding for USDA’s main program for 
protecting sensitive private land, the Con- 
servation Reserve Program. 

What might it all add up to? Paustian 
admits it’s hard to pin down the total cli- 
mate benefit of the $25 billion. But the bill 
also includes a special pot of $300 mil- 
lion for USDA and partners to collect field 
data on carbon sequestered and emissions 
reduced—data could help target future cli- 
mate efforts more effectively, he and others 
say. “That’s really significant,” says Alison 
Eagle, an agricultural economist with the 
Environmental Defense Fund. “This invest- 
ment can help direct the next set of funding 
to the right place.” 
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Massive climate and health bill 
also benefits U.S. energy labs 


Major physics projects and national lab infrastructure will 


get $1.5 billion over 5 years 


By Jeffrey Mervis and Adrian Cho 


n ambitious climate, health care, and 

tax reform bill passed last week by the 

U.S. Congress also contains a 5-year, 

$2 billion windfall for the national 

laboratories run by the U.S. Depart- 

ment of Energy (DOE). Most of the 
money—$1.5 billion—will go to DOE’s sci- 
ence office for new facilities and upgrades at 
the 10 national labs it oversees, which carry 
out basic research in an array of fields. 

“This is a nice little bump for the science 
budget,” says Bill Madia, former director of 
two DOE national laboratories. “I’m sur- 
prised science got anything in this bill.” 
Leland Cogliani of Lewis-Burke Associates, 
a science lobbying firm, cautions that the 
extra money is only a down payment for 
many projects in the pipe- 
line. “It may sound like a lot 
of money, but $1.5 billion 
doesn’t really go very far,” 
he says. 

The new spending, part 


“Tm surprised 
science got 
anything in this bill.” 


of heart that allowed the Democrats to pre- 
vail in the Senate on 7 August, with Vice 
President Kamala Harris breaking a 50-50 
tie. On 12 August, the House of Representa- 
tives approved the measure on a party-line 
vote of 220 to 207, and Biden signed it into 
law on 16 August. 

Despite his concerns about additional 
federal spending, Manchin has repeatedly 
sought to boost support for DOE’s network 
of 17 national labs, including the National 
Energy Technology Laboratory in his home 
state. That lab is funded through DOE’s 
Office of Fossil Energy and Carbon Man- 
agement, which in the IRA would receive 
$150 million over 5 years. 

The House’s original version of Build 
Back Better spelled out how much DOE 
should spend on each of dozens of spe- 
cific projects. In contrast, 
the IRA simply divides the 
$1.5 billion among the of- 
fice’s seven research pro- 
grams and leaves other 
decisions to DOE officials, 


of the Inflation Reduc- Bill Madia, an approach Madia ap- 
tion Act (IRA) of 2022, is former Department of Energy plauds because it enables 
much less than President lab director DOE to be selective. 


Joe Biden had envisioned 

last year in proposing a $3.2 trillion blue- 
print for sweeping social, environmental, 
and economic reforms. That ambitious 
agenda, called the Build Back Better Act, 
included nearly $13 billion for DOE’s Office 
of Science and tens of billions of dollars for 
other federal research agencies. 

Those numbers shrank as the legisla- 
tion went through many iterations and 
names. But the IRA still delivers a sizable 
investment in DOE lab infrastructure de- 
spite the removal of funding for the other 
research agencies. 

The IRA, which offsets some $370 billion 
in additional spending with revenue from 
an array of taxes, was the result of pro- 
tracted negotiations between Senate Ma- 
jority Leader Chuck Schumer (D-NY) and 
Senator Joe Manchin (D-WV). Manchin’s 
opposition to climate and spending provi- 
sions in Biden’s original plan and a subse- 
quent $2 trillion version was sufficient to 
kill both bills. But Manchin had a change 


“The specific spend plan 
is being finalized internally in the Office 
of Science,” says Harriet Kung, the agency’s 
deputy director for science programs. But 
it’s not hard to guess where much of the 
money is likely headed. 

High energy physicists at Fermi Na- 
tional Accelerator Laboratory in Batavia, 
Illinois—DOE’s only high energy physics 
lab—are gearing up to build a massive neu- 
trino experiment. Its price tag has doubled 
to $3 billion, so it could certainly use the 
help. Similarly, the extra money could kick- 
start a proposed Electron-Ion Collider to 
be built at Brookhaven National Labora- 
tory in Upton, New York, at a cost of be- 
tween $1.7 billion and $2.8 billion. 

DOE will prioritize “signature facility 
projects” to advance schedules and reduce 
risks, Kung says, and will favor projects 
nearing completion. It will also consult 
with the White House and Congress, Kung 
says. “All of these approvals are necessary 
before any funding will be allocated.” 
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Scientists isolate the genes 
that shape the human pelvis 


Embryonic tissue samples reveal how pelvis shape— 
primed for bipedalism—comes to life 


By Michael Price 


he wide, basin-shaped human pelvis 

is a defining physical feature of our 

species. Without it, we couldn’t walk 

upright or give birth to big-brained 

babies. Now, a new study of human 

embryos has pinpointed the window 
in embryonic development during which 
the pelvis begins to look humanlike and 
identified hundreds of genes and regula- 
tory RNA regions that drive this trans- 
formation. Many bear the hallmarks of 
strong natural selection for bipedalism, the 
authors conclude. 

“Tt is a really impressive study, especially 
the genomic part, which uses all the bells 
and whistles of state-of-the-art-analysis,” says 
Marcia Ponce de Leon, an anthropologist at 
the University of Ziirich (UZH) who was not 
involved with the work, which was reported 
this week in Science Advances. The results, 
she adds, support the idea that evolution of- 
ten produces new physical features by acting 
on genetic switches that affect early embry- 
onic development. Such predictions are “easy 
to state but very difficult to actually dem- 
onstrate, and this is what the authors did,” 
she says. 

The pelvic girdle in primates consists 
of three major parts: blade-shaped bones, 
called ilia, that fan out to form the hips and, 
below those, two tube-shaped fused bones 
known as the pubis and ischium, which give 
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shape to the birth canal. Great apes have 
relatively elongated ilia that lie flat against 
the back of the animals, as well as relatively 
narrow birth canals. Humans have shorter, 
rounded ilia that flare out and curve around. 
The reshaped ilia provide attachment points 
for the muscles that make upright walking 
more stable, and a wider birth canal accom- 
modates our big-brained babies. Terence 
Capellini, an evolutionary biologist at Har- 
vard University, says those pelvic patterns 
were already emerging in early human ances- 
tors such as the 4.4-million-year-old hominin 
Ardipithecus ramidus, which had slightly 
turned-out ilia and is thought to have at least 
occasionally walked on two feet. 

When and how those features take shape 
in human gestation had been mysterious, 
however. Many of the key human pelvic fea- 
tures, such as its curved, basinlike shape, 
are already developed by week 29. But 
Capellini wondered whether they might 
emerge earlier, when the pelvis has not yet 
turned to bone, but instead has scaffolding 
made from cartilage. 

With the consent of women who had legally 
terminated their pregnancies, the researchers 
examined 4- to 12-week-old embryos under a 
microscope. They found that roughly around 
the 6- to 8-week mark, the ilium begins to 
form and then rotates into its telltale basin- 
like shape. Even as other cartilage within the 
embryo starts to ossify into bone, Capellini’s 
team found this cartilage stage in the pelvis 


Compared with gorilla and chimp pelvises (left 
and middle), the human pelvis is curved and rotates 
out to the side to support upright walking. 


seems to persist for several more weeks, giv- 
ing the developing structure more time to 
curve and rotate. “These aren’t bones, this 
is cartilage that is growing and expanding 
and taking that shape,” Capellini says. 

Next, the researchers extracted RNA from 
different regions of the embryos’ pelvises 
to see which genes were active at different 
developmental stages. Then, they identi- 
fied hundreds of human genes within spe- 
cific pelvic sections whose activity seemed 
turned up or down during the first trimes- 
ter. Of these, 261 genes were in the ilium. 
Many of the downregulated genes are in- 
volved in turning cartilage to bone, whereas 
many of the upregulated genes maintain 
cartilage, Capellini says, and possibly act 
to keep the ilium in a cartilaginous stage 
for longer. 

By comparing the developing pelvis’ ge- 
netic activity with a mouse model’s, the 
researchers also identified thousands of 
genetic on/off switches seemingly involved 
in shaping the human pelvis. Stretches of 
DNA within those switches appear to have 
evolved rapidly since our species’ split from 
our common ancestor with chimpanzees. 
But among modern humans, those regula- 
tory bits in the ilium show strikingly little 
variation. That uniformity, the researchers 
say, is a sign that natural selection put—and 
continues to put—intense pressure on the 
ilium to develop in a highly specific way. 

“We think this is really pointing to the 
origins of bipedalism in our genome,’ 
Capellini says of his team’s work. 

Martin Hausler, an anthropologist at 
UZH, says he’s not surprised at the evidence 
for intense selective pressure on the pelvis, 
but adds that the findings offer an impres- 
sive look into some of the pelvic changes “at 
the very origin of what makes us human.” 
Future work comparing human embryos 
with other primate embryos—rather than 
mouse models—would allow for an even 
better look at how natural selection re- 
shaped the human pelvis, he adds. 

The emerging understanding could also 
help scientists devise treatments for hip joint 
disorders or predict complications in child- 
birth, says Nicole Webb, a paleoanthropo- 
logist who studies pelvic anatomy at the 
University of Tiibingen. Deviations from 
the genetic program that Capellini’s team 
identified can result in disorders such as 
hip dysplasia and hip osteoarthritis, she 
notes. “I hope that it has major implications 
for making people’s lives better. That would 
be huge, to connect paleoanthropology with 
real life,’ Webb says. 
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COVID-19 


Long Covid clues emerge from patients’ blood 


Study implicates lack of key hormone, battle-weary immune cells, and awakened viruses 


By Jennifer Couzin-Frankel 


n ambitious study of people with 

Long Covid, the mysterious, disabling 

symptoms that can trail a SARS-CoV-2 

infection, has turned up a host of ab- 

normalities in their blood. The clues 

add to a body of evidence hinting at 
drivers of the condition and potential treat- 
ments worth testing. They also suggest that, 
as many scientists and patients have sus- 
pected, Long Covid shares certain features 
with myalgic encephalomyelitis/chronic fa- 
tigue syndrome (ME/CFS), another condition 
thought to follow an infection. 

The new study, posted as a preprint last 
week, was modest in size, examining just 
99 people with Long Covid. “But it went very 
deep, it went into granular aspects of the T 
cells, the antibody response,” says Eric Topol, 
director of the Scripps Research Transla- 
tional Institute, who was not involved in the 
work. “This is exploratory, but it’s the founda- 
tion for much bigger studies.” 

The Long Covid patients, most of them 
struggling with intense fatigue, brain fog, 
and other symptoms, had low levels of cor- 
tisol, a stress hormone that helps the body 
control inflammation, glucose, sleep cycles, 
and more. Features of their T cells in- 
dicated their immune system was 
battling unidentified invaders, 
perhaps a reservoir of SARS- 
CoV-2 or a reactivated pathogen 
such as Epstein-Barr virus. 

Other groups studying Long 
Covid patients have reported 
similar results this year, includ- 
ing in a January Cell paper that 
documented low cortisol in those 
with long-lived respiratory symp- 
toms, and reactivation of viruses in 
patients with neurological issues. 
Collectively, these data “make me 
think about what other drugs we can 
test,’ such as virus-directed antibod- 
ies or targeted anti-inflammatories 
to tame the immune system, says 
Emma Wall at University College 
London and the Francis Crick In- 
stitute, who co-leads a large trial of 
potential Long Covid therapies. 

The new Long Covid project be- 
gan in late 2020, when Yale Univer- 
sity immunologist Akiko Iwasaki 
teamed up with David Putrino, a 
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neurophysiologist at the Icahn School of 
Medicine at Mount Sinai who was caring 
for affected patients. The pair wanted to 
compare those patients with people who 
had never been infected—and those who 
had recovered. To Putrino’s surprise, “It was 
quite challenging to find people who were 
fully recovered from COVID.’ Many post- 
COVID-19 volunteers described themselves 
as healthy but then admitted, for example, 
that their once-normal gym workouts were 
too exhausting to resume. In the end, the 
team signed on 39 COVID-19-recovered vol- 
unteers among a total of 116 controls. 

The low cortisol levels in the Long Covid 
patients, about half of normal levels, aren’t 
a total surprise: Symptoms such as fatigue 
and muscle weakness are associated with 
less of the hormone. The cause remains a 
mystery. ACTH, a hormone made by the pi- 
tuitary gland that controls cortisol produc- 
tion, was at normal levels in the Long Covid 
group. Furthermore, note Putrino and oth- 
ers, some Long Covid patients outside the 
study have tried short courses of steroids, 
which can treat low cortisol, but say they 
haven’t helped. Next, the researchers plan 
to track cortisol levels throughout the day in 
Long Covid; the steroid rises and 
falls on a daily cycle, and the 


initial research only tested it in the morning. 

The Long Covid blood samples were also 
awash with a category of “exhausted” T cells 
that can be recognized by certain markers 
they express. Such cells surge in the ongo- 
ing presence of pathogens—suggesting “the 
bodies of people with Long Covid are actively 
fighting something,” Putrino says. 

This battle would produce chronic inflam- 
mation, which matches many Long Covid 
symptoms. By measuring levels of antibodies 
against viral proteins released in the blood, 
the study also noted reactivation of Epstein- 
Barr virus and other herpesviruses whose 
genes can sit dormant inside infected cells 
for extended periods. Iwasaki was intrigued 
to learn that the degree of T cell exhaustion 
appeared to track with Epstein-Barr virus re- 
activation, though she doesn’t consider that 
virus the only potential culprit. SARS-CoV-2 
may linger in Long Covid patients, too, she 
and others say. Epstein-Barr reactivation, low 
cortisol, and T cell exhaustion have all turned 
up in some ME/CFS patients. 

Long Covid is far from uniform, the new 
study makes clear—for example, only about 
20% to 30% of the study’s patients had very 
high levels of exhausted T cells. But, “The 
level of consistency is great” among recent 
studies probing Long Covid biology, says 

James Heath, president of the Institute 
for Systems Biology, an author of the Cell 
paper that found low cortisol and virus re- 
activation. He notes that his group’s study 
examined patients about 3 months after 
SARS-CoV-2 infection, whereas Iwasaki 
and Putrino’s cohort was on average more 
than a year out from their COVID-19. 

Putrino and Iwasaki say it’s time to 
forge ahead with new trials of potential 
therapies, which could also elucidate Long 
Covid’s causes and whether subsets of pa- 
tients are more likely to respond to cer- 
tain interventions. Iwasaki’s experimental 
therapy wish list is long and includes corti- 
sol supplementation; Epstein-Barr virus- 
targeting therapy; the antiviral drug Pax- 
lovid, now used for acute COVID-19; and 
even therapies that deplete B cells, which 
are used to treat autoimmune disease and 
calm the immune system. 

“We should be trying these right now,” 
Iwasaki says. “As a basic scientist, of 
course I’d like to have all the pieces of the 
puzzle” before launching trials. “But the 
patients, they can’t wait.” 
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ANYWHERE BUT H 


E 


China now insists the pandemic didn’t start within its borders. 
Its scientists are publishing a flurry of papers pointing the finger elsewhere 


hen Alice Hughes down- 
loaded a preprint from the 
server Research Square in 
September 2021, she could 
hardly believe her eyes. The 
study described a massive ef- 
fort to survey bat viruses in 
China, in search of clues to 
the origins of the COVID-19 
pandemic. A team of 21 researchers from 
the country’s leading academic institu- 
tions had trapped more than 17,000 bats, 
from the subtropical south to the frigid 


By Jon Cohen 


northeast, and tested them for relatives of 
SARS-CoV-2. 

The number they found: zero. 

The authors acknowledged this was a sur- 
prising result. But they concluded relatives 
of SARS-CoV-2 are “extremely rare” in China 
and suggested that to pinpoint the pandem- 
ic’s roots, “extensive” bat surveys should take 
place abroad, in the Indochina Peninsula. 

“T don’t believe it for a second, says 
Hughes, a conservation biologist who’s now 


at Hong Kong University. Between May 2019 
and November 2020, she had done her own 
survey of 342 bats in the Xishuangbanna 
Tropical Botanical Garden, a branch of the 
Chinese Academy of Sciences (CAS) in Yun- 
nan province where she worked at the time. 
As her team reported in Cell in June 2021, it 
found four viruses related to SARS-CoV-2 in 
the garden, which is about three times the 
size of New York City’s Central Park. 

The new study had sampled bats near 
that same location, at an abandoned mine 
that had yielded another close SARS-CoV-2 
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China has rejected calls for 
independent probes of the 
Wuhan Institute of Virology and 
dismissed the theory that 
SARS-CoV-2 leaked from its labs. 


relative in 2013, and at other sites in nearly 
half of China’s 31 provinces. And yet the 
only thing researchers found were viruses 
close to SARS-CoV-1, which caused the out- 
break of severe acute respiratory syndrome 
2 decades ago. 

Edward Holmes, an evolutionary bio- 
logist at the University of Sydney who co- 
authored the Cell paper with Hughes, dis- 
misses the preprint with a single word: 
“Bullshit.” Although Holmes has no evidence 
the team behind the study did anything 
underhanded, “There is a big contradiction 
between this study and others that needs to 
be explained,” he says. 

But the paper meshed with a growing po- 
litical reality in China. From the start of the 
pandemic, the Chinese government—like 
many foreign researchers—has vigorously 
rejected the idea that SARS-CoV-2 some- 
how originated in the Wuhan Institute of 
Virology (WIV) and escaped. But over the 
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Police closed the 
Huanan Seafood Market 
in Wuhan in January 
2020, shortly after the 
first cluster of COVID-19 
cases surfaced. 


past 2 years, it has also started to push back 
against what many regard as the only plau- 
sible alternative scenario: The pandemic 
started in China with a virus that naturally 
jumped from bats to an “intermediate” spe- 
cies and then to humans—most likely at the 
Huanan Seafood Market in Wuhan. 

Beijing was open to the idea at first. But 
today it points to myriad ways SARS-CoV-2 
could have arrived in Wuhan from abroad, 
borne by contaminated frozen food or in- 
fected foreigners—perhaps at the Military 
World Games in Wuhan, in October 2019— 
or released accidentally by a U.S. military 
lab located more than 12,000 kilometers 
from Wuhan. Its goal is to avoid being 
blamed for the pandemic in any way, says 
Filippa Lentzos, a sociologist at King’s Col- 
lege London who studies biological threats 
and health security. “China just doesn’t 
want to look bad,’ she says. “They need to 
maintain an image of control and compe- 


tence. And that is what goes through every- 
thing they do.” 

The idea of a pandemic origin outside 
China is preposterous to many scientists, re- 
gardless of their position on whether the vi- 
rus started with a lab leak or a natural jump 
from animals. There’s simply no way SARS- 
CoV-2 could have come from some foreign 
place to Wuhan and triggered an explosive 
outbreak there without first racing through 
humans at the site of its origin. “The idea that 
the pandemic didn’t originate in China is in- 
consistent with so many other things,” says 
Jesse Bloom, a virologist at the Fred Hutchin- 
son Cancer Research Center who has argued 
for more intensive studies of the WIV lab- 
accident scenario. “When you eliminate the 
absurd, it’s Wuhan,” says virologist Gregory 
Towers of University College London, who 
leans toward a natural origin. 

Yet Chinese researchers have published a 
flurry of papers supporting their government’s 
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“anywhere-but-here” position. Multiple stud- 
ies report finding no signs of SARS-CoV-2- 
related viruses or antibodies in bats and 
other wild and captive animals in China. 
Others offer clues that the virus hitched a 
ride to China on imported food or its packag- 
ing. On the flip side, Chinese researchers are 
not pursuing—or at least not publishing— 
obvious efforts to trace the sources of the 
mammals sold at the Huanan market, which 
could yield clues to the virus’ origins. 

Wu Zhiqiang, a virologist at Peking Union 
Medical College who is the lead author of 
the Research Square preprint, wouldn’t say 
whether the government vetted his study, 
which is still under review at a journal. But 
he denies that his results were influenced by 
Beijing, let alone cooked up to please it. “We 
do not represent the government or anyone 
else,” writes Wu, who says his team is not 
arguing that the pandemic originated 
outside China. “We just stand by our 
data, and all our data are open access.” 

Yet Holmes, who has close and long- 
standing ties with Chinese scientists, 
worries they have become entangled 
in the government’s political messag- 
ing. “There’s clearly an official narra- 
tive that’s come down from on high 
that appears to be: “The investigations 
have been completed now, and they 
couldn’t find where it came from,” Holmes 
says. “It’s no longer China’s problem— 
this is about a problem from elsewhere.” 
He says his collaborators in China have 
become increasingly guarded, wary of con- 
tradicting that narrative with their own 
findings and opinions. 

Hughes left Xishuangbanna in December 
2021 after 9 years in China because it had 
become too difficult to continue her bat re- 
search there. Most origin-related research re- 
quired government review, she says, and she 
also needed a green light from CAS. “The dia- 
logue is just: ‘We don’t want it to have come 
from here in any form,” she says. “And that’s 
not a sensible way of doing science.” 

Science attempted to discuss these is- 
sues with George Gao, who until last month 
headed the Chinese Center for Disease Con- 
trol and Prevention (CCDC) and is a lead 
author on several key papers about the pan- 
demic’s origins. “What can I say?” Gao texted 
back. “Best wishes.” 


THE NOTION THAT it all began at the sprawl- 
ing Huanan market in downtown Wuhan 
was not controversial at first. A 31 Decem- 
ber 2019 report from the Wuhan Municipal 
Health Commission announced a cluster of 
27 unexplained pneumonia cases linked to 
the market, which was immediately closed. 
Three weeks later, after a novel corona- 
virus had been identified as the cause, 
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government researchers concluded in the 
China CDC Weekly that “all current evi- 
dence points to wild animals sold illegally 
in the Huanan Seafood Wholesale Market.” 
On 27 January, the official Chinese news 
agency Xinhua reported that 33 “envi- 
ronmental samples” from the market had 
tested positive for the virus and all but two 
came from areas selling wildlife. Again, 
the results suggested “the virus stems from 
wild animals on sale at the market,’ the 
article stated. 

That spring and summer, origin research 
continued apace in China, Hughes says. Her 
group dug into bat samples it had collected 
in 2019 at the Xishuangbanna gardens 
and discovered one of the closest relatives 
of SARS-CoV-2 yet found, which shared 
93.3% of its genetic sequence, a finding it 
published in June 2020 in Current Biology. 


The dialogue is just: “We 
don’t want it to have come 
from here in any form.’ 


Alice Hughes, Hong Kong University 


“Throughout 2020, we were encouraged to 
do more work,” Hughes says. 

But the Chinese government’s counter- 
narrative had started to take shape. By the 
end of January, published reports about the 
earliest cases suggested almost half had no 
connection to Huanan, raising doubts about 
its role as the outbreak’s origin. The lab-leak 
theory had also started to circulate interna- 
tionally, and before long, then-U.S. Presi- 
dent Donald Trump began blaming WIV, 
which had studied bat coronaviruses for 
15 years, for sparking the “kung flu” pandemic 
with the “China virus.” In March 2020, Zhao 
Lijian, a spokesperson for China’s foreign 
ministry, flipped the tables, asking whether 
the virus came from a U.S. Army lab at Fort 
Detrick in Maryland. 

The Chinese government also began to 
clamp down on interactions between origin 
researchers and the media. Suddenly, giving 
an interview required permission from the 
Ministry of Science and Technology. “Part 
of that was for a good reason,’ Hughes says. 
“There was some very bad science going on, 
and they didn’t want that to come out.” 

The country’s new mindset colored a 
March 2021 report from an international 
team sent to China earlier in the year to study 
the origin of the pandemic under the auspices 
of the World Health Organization (WHO). 
The report deemed a laboratory accident “ex- 
tremely unlikely” but ranked the frozen food 


scenario as “possible.” A direct zoonotic spill- 
over ranked higher, as “possible-to-likely,’ 
yet the report questioned whether the jump 
could have happened at the Huanan market, 
claiming animals that might have carried the 
virus weren’t there. “No verified reports of 
live mammals being sold around 2019 were 
found,’ the report flatly stated. 

Dutch virologist Marion Koopmans, a 
prominent member of the WHO team, ac- 
knowledges politics played a role in both 
statements. Koopmans says she never put 
much stock in the frozen food route. As to 
the notion that mammals weren’t sold at the 
Huanan market, she says the international 
scientists on the mission were “highly skep- 
tical” of the claim. They even showed their 
Chinese counterparts a photo Holmes had 
taken a few years earlier at that very market 
that showed a caged raccoon dog—a spe- 

cies known to be highly susceptible to 
SARS-CoV-2 and able to transmit it. 

But in the end, hard evidence that 
the market sold mammals in 2019 was 
lacking, and the international scien- 
tists went along with the “no verified 
reports” phrasing, Koopmans says, to 
avoid “the political situation.” 

To Chris Newman, an ecologist at the 
University of Oxford who has long stud- 
ied the illegal sale of wildlife at Chinese 

markets, the report marked a significant shift 
in the Chinese government’s strategy. “They 
tried to start to change the narrative, and 
they’ve been sort of trapped in that ever since,” 
Newman says. “They’re trying to look for any 
alternative explanation, however implausible 
that might be. It’s almost like being caught in 
a childish lie.” 

As it happened, Newman knew the 
statement about the Huanan market was 
dead wrong. “Somebody was misleading 
the World Health Organization here,” he 
says. Newman had helped conceptualize 
and write a study led by Zhou Zhao-Min 
of China West Normal University that sur- 
veyed mammals sold in Wuhan markets 
for 2 years prior to the pandemic, looking 
for the source of a tick-borne infection. It 
documented nearly 50,000 mammals for 
sale at Wuhan’s animal markets, including 
Huanan. Among the species were raccoon 
dogs and civets, also highly vulnerable 
to SARS-CoV-2. 

The team could have published the study 
as a preprint, but it wanted the “credibility” 
of a peer-reviewed paper, Newman says. 
That took a long time. The paper finally ap- 
peared in Scientific Reports in June 2021— 
revealing the glaring misdirection in the 
WHO report published 3 months earlier. 
Zhou and Newman had sent a draft of their 
paper to WHO in October 2020, realizing its 
potential significance for the origins debate, 
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but the agency had not done anything with 
it. “That was a big glitch,” Koopmans says. 
“T find that annoying—I’m putting it mildly.” 

Zhou and the other Chinese co-author, 
Xiao Xiao, did not respond to repeated re- 
quests from Science for interviews about the 
study. “When the paper did finally come out, 
they were both taken aside by their institu- 
tions and told off;? Newman says. “They were 
told to cease all work on wildlife trade. And 
they were quite worried.” 

The paper left out a key detail that might 
help clarify the pandemic’s origin: It did not 
say whether the researchers collected ticks— 
which suck mammalian blood—or took bio- 
logical samples directly from the animals. 
Either could be tested for antibodies to SARS- 
CoV-2 or genetic remnants of the virus itself 
to find out which animal species harbored 
it. Newman wasn’t involved with the sample 
collection, so he asked Zhou, but to 
no avail. “Zhou is not at liberty to 
say, and he doesn’t reply,’ Newman 
says. “It’s so self-evident that surely 
somebody in the Chinese CDC has 
done exactly that. Surely somebody 
knows the answer.” 


EVERYONE FROM LAB-LEAK propo- 

nents to WHO Director-General 

Tedros Adhanom Ghebreyesus 

and U.S. President Joe Biden criticized 
the WHO report, and in its wake, China’s 
willingness to probe the origins of the pan- 
demic took another nosedive. The govern- 
ment scuttled ambitious follow-up plans 
proposed by the group: studies of viral 
genomes of the earliest cases, to trace the 
initial spread; retrospective analyses of Chi- 
nese hospital records prior to the outbreak 
and of illnesses in high-risk groups such as 
farmers and veterinarians; and mapping 
supply chains of farms that sold to the Hua- 
nan market. “The derailment of this whole 
process is horrible,” Koopmans says. 

The finger-pointing at other countries 
also ramped up. On 17 September 2021, 
The Lancet published a four-page corre- 
spondence by Wu and authors at CCDC and 
several of the country’s top academic insti- 
tutions claiming the WHO committee had 
carried out “an extensive investigation in 
Wuhan in the past year.” The letter argued it 
was now “reasonable” to start a “worldwide” 
hunt for the origin and laid out a plan for 
doing so. Invoking the lofty rhetoric com- 
mon in global health circles—“humankind 
must work together’—the article stressed 
that the massive global investigation it pro- 
posed “should be carried out by scientists 
on the basis of science alone, without inter- 
ference or coercion from political forces.” 

By now more than two dozen reports from 
Chinese scientists have suggested the virus 
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came from elsewhere. Many, including Wu’s 
Lancet letter, explored the imported frozen 
food hypothesis. After China had all but 
stopped COVID-19 transmission by March 
2020, researchers linked small outbreaks to 
imported salmon at a Beijing market, frozen 
cod offloaded at shipping docks in Qingdao, 
and imported pollock packaged by a com- 
pany in Dalian. Other scientists tested more 
than 50 million swabs of frozen food pack- 
ages from across China and found that nearly 
1500 had genetic remnants of the virus. In the 
China CDC Weekly they asserted it “cannot be 
ruled out” that just such contamination trig- 
gered the initial Huanan market outbreak. 
Yet researchers rarely detected infectious 
virus on the food—only viral RNA—and food 
regulators in the United States, Europe, Aus- 
tralia, and New Zealand have dismissed this 
route of transmission. There’s also nothing 


bad. They need to maintain an 
Filippa Lentzos, King’s College London 


that connects imported frozen food at the 
Huanan market to the outbreak there. 

Besides, if the virus arrived in Wuhan on 
frozen food, “it means the virus had to be al- 
ready circulating somewhere else, and there’s 
no evidence of that;’ Koopmans says. Stud- 
ies from Spain, Italy, France, Brazil, and the 
United States have reported finding genetic 
pieces of the virus—or antibodies to it—in 
stored tissue from patients or wastewater 
samples that predate the Wuhan outbreak. 
Chinese scientific papers and media have 
made much of such findings, but skeptics say 
the viral traces may be contaminants and the 
antibodies could be responses to other patho- 
gens that “cross-react” with SARS-CoV-2. 

As for the frozen food route, Holmes says 
it’s not even worth discussing. “You might as 
well just say it came from cosmic dust.” 


CHINA HAS NOT allowed foreign researchers 
into the country to conduct independent 
origin studies, but that hasn’t stymied all 
investigations. On 26 February, a large in- 
ternational team that included Holmes and 
Michael Worobey, an evolutionary biologist 
at the University of Arizona, published two 
preprints that offered a new analysis of the 
outbreak’s early days—and pointed directly at 
the Huanan market. (Science published peer- 
reviewed versions online on 26 July.) 

Early reports suggesting some cases were 
not connected to the market mistakenly iden- 


China just doesn’t want to look 


image of control and competence. 


tified people as having the disease who did 
not, the authors contended. They combined 
spatial analyses of where infected people 
lived and worked, genetic sequencing of early 
viral samples, and the location of positive en- 
vironmental samples and the stalls that sold 
mammals highly susceptible to SARS-CoV-2 
to conclude the market was the “epicenter” 
of the pandemic. 

But a preprint from China published a 
day earlier by Gao and many co-authors 
from CCDC reached a starkly different con- 
clusion. The study analyzed more than 1000 
environmental samples from the Huanan 
market and found SARS-CoV-2 in 73 of them, 
indicating the virus was in the sewer, on the 
ground, and in “containers,” some of which 
Worobey suspects may have been cages that 
held mammals, based on their location. 

In a so-called metagenomic analysis, the 
researchers also found several 
samples had a blend of viral and 
human RNA, “which highly sug- 
gests the SARS-CoV-2 might have 
derived from Homo sapiens” in the 
market, the authors wrote. In other 
words, humans didn’t catch SARS- 
CoV-2 from animals at the market. 
Instead, people might have brought 
it to the market from elsewhere. 

Again, the researchers men- 
tioned viral traces on imported frozen food 
and the retrospective studies from other 
countries as clues to a foreign origin. “Defi- 
nitely, more work involving international co- 
ordination is needed to investigate the real 
origins of SARS-CoV-2,” they concluded. 

Worobey and Holmes were gobsmacked by 
what the study didn’t say. In a graphic that 
illustrates the metagenomic analysis, dots 
show the virus mixed with RNA from several 
species other than humans, but the preprint 
does not specify which species. “They ignore 
all the other animals in there,’ Holmes says. 

A researcher who claims to have reviewed 
the manuscript for Nature says it’s not clear 
whether the omission was intentional or due 
to sloppy science, but either way, “I said you 
cannot publish this unless they release the 
raw data.” The reviewer, who asked not to be 
named, noted that the paper also includes 
an analysis of a sample from a defeather- 
ing machine at the market that found only 
human DNA. “Either only humans were de- 
feathered or the analysis was wrong—pick 
your favorite,’ the reviewer says. Gao did not 
reply to Science’s specific request to discuss 
the metagenomic data. 


IN AN ATTEMPT to reboot the origin probe, 
WHO last year created the Scientific Advi- 
sory Group for the Origins of Novel Patho- 
gens (SAGO), a new international team 
of 26 people. SAGO’s preliminary report, 
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Health officials 

in Taiyuan 

swab fish at a 
market in January 
2021. Chinese 
researchers 

say imported frozen 
food has caused 
COVID-19 outbreaks 
in the country. 


A bat is sampled for 
viruses in Thailand. 
China insists 
SARS-CoV-2 likely 
originated outside 
its borders and 

has advocated for 
more global 

efforts to find the 
virus’ origin. 


issued on 15 May, offers recommendations 
echoing those of the earlier team, with two 
major differences: It calls for more stud- 
ies of a potential “laboratory incident’— 
although members from China, Russia, and 
Brazil dissented—and it recognizes that the 
Huanan market did sell mammals suscep- 
tible to the virus. 

But by then, “anywhere but here” had be- 
come the Chinese government’s party line. 
“More and more clues from the interna- 
tional science community are pointing the 
origins of SARS-CoV-2 to sources around 
the world,” Zhao, the foreign press ministry 
spokesperson, asserted at a 10 June press 
conference in response to the SAGO report. 
He again brought up the controversial re- 
ports of very early cases outside China and 
called for investigations at the “highly suspi- 
cious” Fort Detrick laboratory. 

Wu says many Chinese researchers re- 
main open to the idea that the virus origi- 
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nated in China. “We have never ruled out 
Wuhan or China in the next phase of the 
investigation, and Chinese researchers are 
still working on it and continuously getting 
data,” he says. He stresses that, in response 
to peer reviewers, his team has “toned 
down” some of its bold assertions in the Re- 
search Square preprint about the bat survey 
in China. For example, instead of calling vi- 
ruses related to SARS-CoV-2 in Chinese bats 
“extremely rare,’ the manuscript now says 
they “might be rare.” 

But the clampdown on bat research that 
Hughes experienced has continued, espe- 
cially at the local level, says Aaron Irving, a 
bat researcher at Zhejiang University. “Peo- 
ple are now often scared to help out bat re- 
searchers in case they also get in trouble,” he 
says. Irving stresses that his institution still 
supports his work, but what he calls “non- 
scientific interference” has taken a toll. “Ev- 
eryone in the bat field got a little secretive 


More and 
more clues ... 
are pointing the 

origins of 
SARS-CoV-2 to 
sources around 

the world. 


Zhao Lijian, 
Chinese foreign ministry 


and protective too because they don’t want to 
be targeted.” 

Some researchers think too much time has 
passed to solve the origin mystery. But oth- 
ers who favor a natural origin say they only 
need a few more jigsaw pieces to complete 
the puzzle. Newman suspects Chinese offi- 
cials and researchers may already hold some 
of those pieces. “They should be in a position 
to know an awful lot more than we are cur- 
rently told,” he says. 

There is no shame in admitting that the 
virus came from wild animals illegally sold 
at a market, Newman adds. “To leave it as 
an open question, it’s just going to breed in- 
trigue about why the Chinese have not given 
us a clear explanation,’ Newman says. “Why 
the smoke and mirrors? Whereas if they 
come up with something that we could all 
just accept, well, then it will be case closed. 
Can’t we persuade them that this is the right 
thing for everybody to do?” 
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FOOD lobally, consumers have increasingly 
been getting the meals they consume 


Rese a rc h a ial d ol [ C fo i th e delivered by third parties such as 
p 1] Doordash, Grubhub, Wolt, or Uber 
= 4 Eats. This trend is attributable to 
food = d @ [ ive i revo | u ti 0 n broader changes in food systems and 
y technological and institutional innovation 


(such as apps and digital platforms and the 


Implications for nutrition, environment, and work earsaced (inne on kd perieedan toad 
may be considerable delivery) and has sharply accelerated as a 


consequence of the lockdowns resulting 
By Eva-Marie Meemken’, Marc F. Bellemare?, Thomas Reardon*, Carolina M. Vargas? from the COVID-19 pandemic (J, 2). Global 
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revenues for the online food delivery sector 
were about $90 billion in 2018, rose to $294 
billion in 2021, and are expected to exceed 
$466 billion by 2026 (3). The consequences 
and policy implications of this “delivery 
revolution” remain poorly understood but 
deserve greater attention. We offer an over- 
view of the drivers of the revolution and 
discuss implications for the environment, 
nutrition, and decent work, as well as re- 
cently implemented and potential policy 
options to address those consequences. 
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Adriver for food delivery service Uber Eats 
arrives to pick up food in Berlin’s Friedrichshain 
district, September 2021. 


DRIVING THE REVOLUTION 

There has been a steady rise in all regions 
of the globe in the share of sales of food- 
service revenue from home delivery, from 
2 to 7% in 2009 to 6 to 16% in 2019, and 
then a steep spike in 2021 to 13 to 27% (see 
the figure). All regions showed a similar 
growth pattern, and all remained within a 
tight band, except Latin American countries 
(LACs), which stayed well above the oth- 
ers. This delivery revolution is part of and 
driven by broader transformations of the 
entire food system (J, 4). We highlight three 
global trends in particular that are impor- 
tant: (i) the nutrition transition, (ii) growth 
and transformation in the food-service sec- 
tor, and (iii) the spread of technological and 
institutional innovations. 

Global diets have been changing as a re- 
sult of economic growth, urbanization, in- 
creasing female labor market participation, 
and growing international trade of food and 
other commodities, among other factors (J, 
4, 5). The nutrition transition refers to these 
changing dietary patterns, including rising 
demand for (i) higher-value products (such 
as fruits; vegetables; animal products; and 
products that meet high food safety, sus- 
tainability, or nutritional standards); (ii) 
processed foods that often contain high 
levels of sugar, salt, and fat and are often 
convenient and time-saving; and relevant to 
our context, (iii) food consumed away from 
home (such as in restaurants) and food pre- 
pared away from home (such as take-away 
food or food delivery) (1, 4, 5). The pan- 
demic has—through lockdowns and quar- 
antine orders—shifted much of the demand 
for food consumed away from home to food 
prepared away from home. With lockdowns 
and other COVID-19 measures being lifted 
in many countries, this trend is reversing 
somewhat, but we have good reason to be- 
lieve that newly acquired food habits are 
here to stay (6). 

Growth of the restaurant and broader 
food-service sector is a second trend that 
paved the way for the food delivery revo- 
lution. Restaurants and other food-service 
establishments have been spreading, first 
in high-income countries (HICs) from the 
1960s on (6) and then in low- to middle- 
income countries (LMICs) over the past 
several decades, given increasing demand 
and reduced transaction costs by way of the 
agglomeration economies associated with 
greater urbanization. The restaurant sector 
emerged, first as informal-sector small and 
medium enterprises that eventually joined 
the formal sector, and then into large enter- 


prises such as fast-food chains. In LMICs, 
this has been driven by domestic invest- 
ment and by the liberalization of foreign 
direct investment inducing an influx of 
foreign chains (5). Historically, restaurants 
have often offered takeout or delivery ser- 
vices (6), but only with increasing competi- 
tion has food delivery evolved as a profit- 
able—and thus common —service offered 
by various types of restaurants. 

Third, the spread of institutional and 
technological innovations has facilitated 
the delivery revolution. Delivery and take- 
out are as old as the restaurant as an eco- 
nomic institution. Delivery services were 
initially mainly offered by specialized 
lunch providers for lower-income workers 
(such as tiffin lunch boxes in India) and 
by restaurants serving easy-to-transport 
foods (such as pizza). Although food deliv- 
ery was already widespread before the turn 
of the century as offered by restaurants 
themselves, the spread of information and 
communication technologies (ICT) was a 
key driver of the delivery revolution. The 
proliferation of landlines made remote or- 
dering an accessible option for the masses. 
Remote ordering alters who travels (con- 
sumers versus producers) to bring food to 
homes, offices, or other locations and thus 
serves consumers eager to save time. The 
spread of ICT then made it more conve- 
nient to order remotely as well as to com- 
pare and discover different options (2). 

The proliferation of ICT had another 
important implication: It led to restau- 
rants outsourcing their delivery activi- 
ties to new, third-party specialized actors. 
Providers such as Doordash and Grubhub 
act as middlemen, linking large numbers 
of consumers and large numbers of food 
providers. The outsourcing of delivery and 
the rise of these third-party intermediates 
is an institutional innovation that allows 
smaller establishments, which might hith- 
erto not have had the capacity to offer de- 
livery themselves, to reach a broader con- 
sumer base. 

The “app-based” food-delivery sector has 
taken off over the past two decades. The 
COVID-19 pandemic accelerated the rise of 
delivery firms and the number of restau- 
rants that pivoted toward delivery in order 
to remain profitable (J). Such unantici- 
pated shocks often act as catalysts, spur- 
ring innovation and the adoption of new 
technologies. 
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IMPLICATIONS FOR POLICY AND RESEARCH 
The food-delivery revolution has important 
implications for environmental protection, 
healthy diets, and poverty reduction through 
employment generation and decent work. In 
what follows, we discuss the small but grow- 
ing body of scientific evidence in this area (2, 
7) and highlight research gaps that are rele- 
vant for policy. Because of the limited nature 
of the empirical evidence on this topic, our 
discussion also draws on theoretical consid- 
erations and public discussions. 


JOB CREATION AND DECENT WORK 
The food-delivery revolution has created 
new jobs, with understudied implications 
for migrant workers and their households 
and communities, especially in LMICs. 
Most new jobs in this growing sector in- 
volve unskilled labor (more jobs created 
for delivery workers than for managers and 
tech personnel). In HICs, these jobs are of- 
ten done by immigrants, including many 
undocumented migrants from LMICs (8), 
who send remittances home. During the 
COVID-19 pandemic, food-delivery jobs 
were also filled with local food-service 
employees who shifted from in-restaurant 
to delivery or warehouse work. In LMICs, 
those same urban jobs are often done by 
young rural migrants seeking opportunities 
outside of agriculture (9). Because fertility 
rates are high in many LMICs, the food de- 
livery and broader food-services sector help 
absorb the “youth bulge.” The economic 
effects of the food-delivery revolution on 
workers’ households and communities re- 
main poorly understood and are one direc- 
tion for future research. Similarly, a better 
understanding of who selects into delivery 
jobs in different regions and countries can 
also help better design and target policies 
aimed at improving working conditions. 
Delivery jobs are often precarious even 
in HICs, with poor safety standards and 
frequent traffic accidents, unpredictable 
schedules, long working hours, informal 
work arrangements, and low wages (8, 9). 
This is not only because delivery jobs are 
low skilled: Food delivery workers are also 
often freelance workers, like other gig- 
economy workers not directly hired but 
contracted with by the provider (8). The 
key concern is that the actual job meets 
many criteria of jobs that typically fall into 
the category of employment, and thus out- 
sourcing shifts responsibilities and risks to 
workers. At the same time, those workers 
are often not supported by labor unions 
because they are self employed. Thus, even 
in regions with stringent labor regulations 
and high levels of unionization (such as 
Scandinavia), delivery workers often face 
precarious working conditions. In the case 
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of LMICs, many if not most delivery pro- 
viders remain in the informal sector, where 
working conditions tend to be precarious. 
More research documenting working con- 
ditions among delivery workers in differ- 
ent regions and countries, and for different 
delivery firms would be useful and could 
focus on aspects such as payment and 
transportation modes, consumers’ tipping 
habits, recruitment strategies, and work- 
ers’ preferences and levels of satisfaction. 
There are several policy options to im- 
prove working conditions among delivery 
workers, and key challenges differ across 
regions. In HICs, any real improvement 
would require addressing regulatory gray 
areas, especially those related to the mode 
of employment (for example, self employ- 
ment and outsourcing of risks and costs 
by quasi-employers). In LMICs, monitoring 


companies to pay for the maintenance of 
their worker’s bicycles. Some private sec- 
tor-driven approaches have also emerged. 
In Switzerland, for example, some food de- 
livery providers have embraced public con- 
cerns around poor working conditions and 
recognized their consumers’ willingness to 
pay for better working conditions and have 
thus promised better employment condi- 
tions as part of their business and mar- 
keting model. Because all of these policies 
are new, their impact remains unclear and 
should be evaluated. 


NUTRITION 

The food-delivery revolution is both a cause 
and a consequence of the nutrition transi- 
tion, exacerbating challenges to promot- 
ing healthy diets and food environments 
to reduce overweight [body-mass index 


A global delivery revolution 


Revenue from home delivery is growing, shown here as a share of the total revenue of food service 
(full and limited service restaurants, bars and cafes, and street stalls and kiosks). See the supplementary 


materials for details on the data. 
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is a key challenge. Because many delivery 
providers are small enterprises and oper- 
ate in the informal sector, they are outside 
the reach of regulation and taxation. Even 
if new regulations are adopted, many LMIC 
governments are already strapped for re- 
sources to monitor and enforce regulations 
in fragmented industries in which there are 
many hundreds of thousands of small and 
informal providers. It would thus be a chal- 
lenge to enforce worker payment and work- 
condition regulations in these contexts. 
There are, however, several examples— 
mostly from HICs—of recent policies aimed 
at improving working conditions. For ex- 
ample, Australia is introducing regulations 
that require delivery companies to pro- 
vide safety equipment and training to their 
workers. Germany is now requiring delivery 


2015 


2016 2017 2018 2019 2020 2021 
(BMI) > 25], obesity (BMI > 30), and related 
noncommunicable diseases in HICs and in- 
creasingly in LMICs (4, 5). 

Most of the research so far focuses on 
away-from-home food consumption in 
HICs without necessarily differentiating 
between delivery and restaurants, although 
the digitalization of food environments is 
an emerging research topic (7). Restaurant 
meals—especially fast food—often contain 
higher levels of salt, sugar, and fat than 
those of home-cooked meals. The evidence 
also suggests that eating restaurant food 
instead of eating home-cooked meals can 
incentivize overeating and a higher calorie 
intake (6)—for example, from high alcohol 
consumption (JO) and selecting unhealthy 
meals (11). Yet in this comparison (restau- 
rant versus home-cooked meals), the in- 
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come level of consumers matters because 
home-cooked meals of wealthy consum- 
ers tend to be healthier than those of less 
wealthy consumers (11). 

An important question in this context is 
whether people eat more—or differently— 
when they get food delivered instead of eat- 
ing in restaurants. Here, more research will 
be needed (7). Generally, the proliferation 
of delivery has increased the availability of 
both healthy and unhealthy options. The 
evidence so far suggests that many foods 
available by means of online platforms are 
of poor nutritional quality (12). Thus, an 
important empirical question is whether 
improved and more convenient access to 
especially unhealthy meals through delivery 
increases consumption rates of unhealthy 
meals. Beyond accessibility, social context 
might also matter (for example, order- 
ing decisions can be influenced by peers). 
Whether delivery promotes overeating rela- 
tive to dining in is also an open question. 
Delivery might promote overeating rela- 
tive to restaurants given minimum order- 
ing thresholds on many delivery apps and 
the absence of social control, which may 
limit the unhealthfulness of what one or- 
ders when eating in a restaurant. However, 
delivery could reduce overeating because 
leftover food from delivery might be saved 
for later or shared because getting food de- 
livered reduces the near obligation, when 
dining in a restaurant, of ordering as many 
meal portions as there are diners. 

Thus, more research is needed to inform 
policy, especially in LMICs, where our un- 
derstanding of the food landscape is less 
complete than in HICs. Because unhealthy 
foods and ingredients are a key problem in 
the context of both restaurants and delivery, 
taxes on certain products (such as sugar- 
sweetened beverages) could be imposed, as 
has already been done in many countries 
(5). However, broader policies that address 
the underlying causes of poor food choices 
(for example, through health education, 
school feeding programs, social protection, 
safety nets, and pro-poor growth) are also 
likely needed. 


ENVIRONMENT AND CLIMATE CHANGE 
Some of the environmental challenges 
facing the food delivery sector include 
single-use packaging, food waste, and trans- 
portation and energy consumption (such as 
for cooking), which cause air and water pol- 
lution as well as CO, emissions (13). 

It is likely that delivery causes more pack- 
aging waste than does home preparation, 
and also somewhat more than restaurants, 
because more packaging is used for food 
delivery, takeout, and fast food (such as 
trays, cups, utensils, and napkins). The use 
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of single-use packaging has soared recently 
with the rise of food delivery. This causes 
solid waste pollution, especially in LMICs, 
where solid waste and recycling systems are 
often of poor quality (14). 

Several policy options are available to 
address these problems, but more research 
is needed to understand which approaches 
are most effective. In LMICs, investments 
in recycling systems and environmental 
education are required. In both LMICs 
and HICs, reducing the use of plastic items 
should be a policy goal, and several coun- 
tries have already banned certain single-use 
items (such as plastic straws). Alternatively, 
firms using these items could be taxed. 
Policy-makers could also subsidize the devel- 
opment and use of reusable or compostable 
packaging, promoting a growing trend. An 
increasing share of consumers wants envi- 
ronmentally friendly restaurant and deliv- 
ery practices. Over time, this trend will lead 
to more firms—especially those targeting 
higher-income, environmentally conscious 
consumers—to use compostable or reusable 
packaging (/4, 15). 

Whether food prepared away from 
home causes more food waste is an em- 
pirical question. It is also an open question 
whether where an individual eats affects 
how much food waste is generated; con- 
sumers might order more food when that 
food is delivered given that delivery fees 
are higher per unit when small quantities 
are ordered. When there is poor coordina- 
tion between restaurants and food deliv- 
ery, Meals might not get picked up in time 
and thus go to waste. Policies to encourage 
food waste reduction include education and 
regulation. Some European countries, for 
example, require supermarkets and baker- 
ies to donate food that is not sold in time. 
During the pandemic, some food-delivery 
firms partnered with community organiza- 
tions to donate leftover meals to the poor. 
Some restaurants also offer leftover meals 
late at night at very reduced prices through 
online platforms. Policy-makers could sup- 
port such initiatives. 

Transportation-related carbon dioxide 
(CO,) emissions occur irrespective of where 
meals are consumed. Ingredients need to 
be transported to the place where the food 
is prepared. Then, meals and consumers 
need to be in the same place, whether that 
place is a restaurant or consumers’ homes. 
When consumers do not drive themselves 
(when they order food delivered), average 
distances that meals travel might increase. 
Large-scale food delivery and _ prepara- 
tion, however, likely exploit economies of 
scale in various areas relevant to resource 
use and environmental protection. For ex- 
ample, delivery workers, unlike consumers, 


plan routes to deliver more than one order. 
Similarly, buying larger quantities of ingre- 
dients at once also reduces transport emis- 
sions per unit of food, and cooking larger 
quantities might also reduce resource use 
(such as water and energy) per unit of food. 

We are unaware of research on this topic, 
but given the many different scenarios and 
means of transportation (such as bicycle, 
motorcycle, or car), cooking, and other 
forms of energy use (such as refrigeration), 
it seems that making generalizable state- 
ments about the relationship between CO, 
emissions and delivery is going to be diffi- 
cult. For transportation, one’s best hope to 
do so might be to look at the progressive 
spread of adoption over time and compare 
it with air quality over the same time period 
for several metropolitan areas. That said, 
in many European cities, delivery workers 
use bicycles because they are faster and 
cheaper. The most effective way to reduce 
transport-related emissions might be to 
make cities more bicycle-friendly. 

As of writing this paper, several countries 
have implemented new policies related to 
the food delivery sector. Such efforts may 
help advance progress toward the United 
Nations’ Sustainable Development Goals 
(SDGs), especially those focusing on re- 
ducing all forms of malnutrition (SDG 2) 
and poverty, ensuring decent work, and 
spurring economic growth (SDGs 1 and 8), 
and sustainable farming and consumption 
(SDG 12). More research will nonetheless 
be needed to evaluate various policies re- 
lated to the delivery sector, both ex ante 
and ex post. 
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METASTABLE MATERIALS 


Making the most of metastability 


Researchers seek to preserve materials that are 


formed at high pressure 


By Wendy L. Mao! and Yu Lin? 


xtreme environments are useful for 

studying previously uncharacterized 

phases and properties of materials 

(1). Researchers may discover mate- 

rials with desirable features, which 

would be especially attractive if they 
can be retained under ambient conditions. 
Although the thermodynamically stable 
state of a material is generally determined 
by its lowest energy configuration, there is 
also the possibility of a metastable state at 
local energy minima—a “valley” on a curve 
that is not the overall lowest point. A lo- 
cal energy minimum allows phases to exist 
outside their stability field for an extended 
period of time, making metastability a rich 
subject for exploring material properties. On 
page 870 of this issue, Xiao et al. (2) report 
using nanocrystals to explore the processes 
that influence the height of the kinetic en- 
ergy barrier between different structures to 
preserve, at ambient conditions, phases that 
occur at high pressure. 

Materials with the same composition can 
have substantially different properties based 
on their structure, making them desirable for 
different applications. A well-known example 
of a metastable material is diamond, which is 
stable at high pressure and temperature but 
whose metastability field extends far beyond 
equilibrium and back to ambient conditions. 
Knowledge about this high-pressure phase of 
carbon has guided the development of tech- 
niques to mimic the high-pressure and tem- 
perature conditions for creating synthetic 
diamonds. An understanding of diamond’s 
carbon bonding has inspired an alternative 
synthesis pathway for creating diamonds, 
by using the chemical vapor deposition of 
carbon in a methane plasma under near- 
vacuum conditions. This process allows the 
generation of the metastable diamond phase 
without requiring high pressure. 

Although metastability can sometimes 
offer alternative ways to access the compet- 
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ing structural phases, much of the research 
remains limited to certain materials and 
is not generally applicable for materials 
design. A key consideration shared by all 
metastable phases is the kinetic energy bar- 
rier, which can prevent the material from 
transforming back to the thermodynami- 
cally stable phase. Xiao et al. studied the 
rock-salt structure phase—the same struc- 
ture found in sodium chloride (NaCl)—in 
cadmium sulfide (CdS) and cadmium sel- 
enide (CdSe) semiconducting nanocrystals. 


Preserving high-pressure 


phases in nanocrystals 
Model nanocrystal systems help determine how 
ambient conditions can support phases observed at 
high pressure. 
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Previous theoretical predictions and experi- 
mental observations indicated that defects 
and strain can reduce the intrinsic energy 
barrier and the metastability of this phase 
under ambient conditions. 

Xiao et al. explored numerous factors that 
can be tuned in their model nanocrystal 
systems. The factors include different com- 
positions, starting atomic arrangements, 
nanoparticle shapes, and the addition of li- 
gands bound at the nanocrystal surface. The 
authors observed the degree of long-range 
ordering in the nanocrystal assemblies to 
see how these factors influence defects and 
strain in the crystals (see the figure). They 
also investigated the effects of pressure, 
such as the highest pressure applied to the 
system and the rate of the pressure release, 
in preserving the high-pressure metastable 
phase under ambient conditions. Xiao et al. 
found that there is a noticeable interplay be- 
tween these parameters and identified the 
formation of interconnected nanocrystal 
networks with reduced strain and defects 
as a key mechanism for maximizing barrier 
height and metastability. The interconnected 
nanocrystals help preserve the metastable 
rock-salt phase at ambient conditions. This 
revelation provides insight into developing 
generalized strategies for engineering trans- 
formation barriers that retain metastable 
phases in these and similar semiconductors 
and transition metal chalcogenides. 

Another recent effort to control meta- 
stability involves designing pressure- 
temperature synthetic pathways and direct- 
ing subtle structural differences in a halide 
perovskite system (3). Halide perovskites 
are a class of versatile, crystalline semicon- 
ductors that have shown great promise in 
a range of optoelectronic applications, such 
as photovoltaics (4) and solid-state light- 
ing (5). By understanding the structural 
properties that determine the energy re- 
quirements between competing metastable 
structures, a narrow set of conditions were 
identified for accessing a useful, metastable 
halide perovskite phase. The application of 
pressure drives delicate atomic rearrange- 
ments and minimizes the energy differ- 
ence between the metastable phase and the 
thermodynamically stable one. Subsequent 
rapid quenching from high temperature 
preserves the desired metastable struc- 
ture at ambient conditions. This presents a 
promising strategy to control the structure 
of a material by tuning the thermodynamic 
and kinetic conditions, preserving certain 
desirable metastable phases. 

A challenge when considering the poten- 
tial for metastable materials is that many 
high-pressure transitions have small kinetic 
energy barriers, which results in the high- 
pressure phase transforming back to the 
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low-pressure forms. In addition, metastable 
phases may also lose their outstanding high- 
pressure properties when the conditions are 
removed. For example, elemental hydrogen 
and some hydrogen-rich compounds are 
high-temperature superconductors at ex- 
tremely high pressure, but they do not retain 
their superconductivity after the pressure 
is released (6). To preserve desirable phases 
and properties under ambient conditions, 
one may learn from the geological diamond- 
inclusion process, where a piece of some- 
thing else is trapped, that is, included, inside 
a diamond as both form deep underground. 
As a result, the diamond can maintain the 
high-pressure conditions for the embedded 
compound when exposed to ambient atmo- 
spheric pressure (7). This process enables the 
preservation of high-pressure phases that 
are not metastable at ambient conditions 
and the potential for retaining favorable 
high-pressure properties. It may be mim- 
icked by using carbon precursors to form 
capsules around a sample of interest and 
then converting the carbon precursor into 
diamond at high pressure and temperature. 
Once formed, the nanocrystalline diamond 
capsule can maintain high-pressure condi- 
tions in the sample even after the external 
pressure is released (8). 

There is a need to investigate general ap- 
proaches for leveraging metastability for ac- 
cessing a broader range of material systems 
under practical conditions using both ex- 
periments and computational tools (9, 10). 
Extreme environments are a playground 
for discovering phases with enhanced ma- 
terials functionality and their transition 
mechanisms and for exploring pathways to 
recover these phases under ambient con- 
ditions. The goal is to learn enough about 
these systems to rationally design materi- 
als with exceptional properties that can be 
preserved and ultimately synthesized under 
practical conditions. 
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By Chengyi Hou! and Meifang Zhu? 


hermoelectric materials can convert 

a temperature gradient into electric- 

ity or convert electricity into a tem- 

perature gradient. They can be used to 

harvest waste heat and help to reduce 

the carbon footprint of many applica- 
tions, ranging from computers to industrial 
equipment. Because the heat sources usually 
have a large area, planar thermoelectric gen- 
erators with series-connected thermoelec- 
tric materials can satisfy the requirements. 
However, these materials—mostly inorganic 
thermoelectric semiconductors—are nor- 
mally rigid and brittle, which restricts their 
application. On page 854 of this issue, Yang 
et al. (1) report a room-temperature ductile 
inorganic thermoelectric semiconductor that 
is flexible. This material opens the door to 
new applications such as wearable devices 
that can be powered by converting body heat 
to electricity. 

Heat is continuously generated by the hu- 
man body, of which about 40 mW/cm? is dis- 
sipated through the skin (2). Thermoelectric 
generators can convert this heat into elec- 
tricity by using the temperature difference 
between the human body and the ambi- 
ent environment. To realize the application 
for powering wearable devices, researchers 
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‘semiconductors can 


Yang et al. created a ductile inorganic thermoelectric 
semiconductor that can be used to generate power for 
wearable devices by using body heat. Shown here is a 
wristband-like thermoelectric generator that integrates 
series-connected superthin ductile thermoelectric 
semiconductors on flexible polymer substrates. 


have sought flexibility by using conductive 
polymers and organic-inorganic hybrids. 
However, the poor electrical transport of 
known organic-based thermoelectric materi- 
als leads to low thermoelectric efficiency, as 
measured by a “figure of merit” denoted as 
“zT’ and calculated by using the electrical 
conductivity, the thermal conductivity, and 
Seebeck coefficient (3). 

To meet the demand for high-performance 
flexible thermoelectrics, Yang et al. devel- 
oped a series of room-temperature ductile 
inorganic semiconductors with a high g7. In 
contrast to other inorganic semiconductors 
and ceramics that are usually brittle, these 
semiconductors can change shape instead of 
breaking—tolerating up to 18% strain—and 
can be machined to produce thin films with 
a thickness in the 10° m range. Being ductile 
and made thin enough, these inorganic ther- 
moelectric materials can be easily bent (4), 
thus facilitating the fabrication of high-per- 
formance flexible thermoelectric generators. 

The first room-temperature ductile inor- 
ganic semiconductor was discovered in 2018 
(5). Since then, researchers have worked 
on many ductile thermoelectric materials, 
such as indium selenide (InSe) and silver 
sulfide (Ag,S) (5-9). Most of these materi- 
als and their derivatives are n-type, where 
the main carriers are electrons (5-8), with a 
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maximum 27 value of around 0.44 at room 
temperature. The only known p-type ductile 
inorganic semiconductor, where the main 
carriers are electron holes instead of elec- 
trons, is a mixture of Ag, copper (Cu), Se, 
and S known as AgCu(Se,S) solid solution. 
However, the material has a poor zT value 
(0.01) at room temperature (9). Generally, an 
optimal thermoelectric generator would con- 
tain alternating grains of p- and n-type ther- 
moelectric materials known as p-n couples; 
thus, being the weaker link in p-n couples, 
good p-type ductile thermoelectric materials 
are urgently pursued. 

Yang et al. added tellurium (Te) to AgCuSe 
to convert its electrical conduction from n- 
type to p-type. This helped to increase the 
concentration of cation vacancy, but the re- 
sulting AgCu(Se,Te) solid solution is just as 
brittle as AgCuSe. The authors found that by 
mixing in a tiny amount of S in AgCu(Se,Te), 
a brittle-to-ductile transition occurs in the re- 
sulting AgCu(Se,S,Te). The increased amount 
of strongly polar Ag-S bonds increases the 
energy required for crystal plane cleaving, so 
these planes can hold the material together 
under an outer force instead of forming 
cracks during slipping. However, the Ag-S 
bonds also suppress the cation deficiency in 
the solid solution. Therefore, the amount of 
new Ag-S bonds also need to be balanced so 
as not to decrease the zT value too much. 

After optimization, Yang et al. observed 
a p-type ductile thermoelectric semicon- 
ductor with a gT of 0.45 at room tempera- 
ture. This leads to a promising possibility 
for realizing practical p-n thermoelectric 
couples, of which the thickness also deter- 
mines the material’s electrical resistance in 
addition to the bendability. The intrinsic 
brittleness of classic inorganic thermoelec- 
tric semiconductors, such as Bi,Te, (10), 
prevents the fabrication of a thin and flex- 
ible thermoelectric generator. By contrast, 
the flexible device fabricated by Yang et al., 
which contains six superthin p-n thermo- 
electric couples connecting in series, has 
an internal resistance of only 18 milliohm, 
which is more than 100 times less than that 
of bulk Bi,Te,-based thermoelectric genera- 
tors. This eventually resulted in a record- 
breaking maximum normalized power 
density of 30 ubW cm? K®, which is four 
times greater than that of the Bi,Te,-based 
thermoelectric generators and 10,000 
times greater than that of the organic- 
based flexible thermoelectric generators. 
The improved output performance can 
help enable self-powered wearable elec- 
tronics such as a smart wristwatch with 
an LED display, a heart rate monitor, and 
other functional units that are all powered 
by a flexible thermoelectric wristband. 

From an application viewpoint, additional 
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attention should be paid to the flexible ther- 
moelectric generator’s performance stability. 
Yang et al. demonstrated a 5% internal resis- 
tance increment in the thermoelectric gen- 
erator after bending and unbending it 500 
times around a radius of 15 mm. Whether the 
ductile feature or other unexplored mecha- 
nisms will support good elastic stability un- 
der cyclic strain remains an open question. 
Because the thermoelectric output power 
increases with increasing temperature differ- 
ence, it is also pertinent to maximize the tem- 
perature difference across the superthin ther- 
moelectric generator from the skin-facing 
side to the other side. Although experimental 
results suggest that thermal convection and 
diffusion on the exposed thin film surface 
may enhance temperature difference to 1 to 
2 K, it is still not competitive to the value 
measured in many millimeter-thick rigid 
thermoelectric generators, which ranges up 
to 20 K. Hopefully, passive radiative cooling 
and/or the combined thermoregulation ap- 
proaches (77) will be useful for this purpose. 
Continuous efforts from interdisciplinary col- 
laborations are thus required. 

It is also important to consider the wear- 
ing comfortability of the material. Wearable 
electronics need to have a certain level of 
air and moisture permeability, softness, and 
lightness (72). Although thermoelectric fibers 
have been developed and integrated into tex- 
tiles for comfortable wearable body heat har- 
vesting (13), most of them are made of poly- 
mers and have relatively poor thermoelectric 
properties. The good machinability of ductile 
inorganic semiconductors may enable the 
fabrication of high-zT thermoelectric fibers, 
which potentially are set to hold advantages 
over existing thermoelectric fibers. This re- 
search will help to guide future studies on 
flexible heat harvesting and sensing applica- 
tions and is beneficial to the growth of por- 
table, flexible, and wearable electronics. 
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Taking the “F” 
out of forever 
chemicals 


The right solvent mix 
breaks down perfluorinated 
organic acids 


By Shira Joudan! and Rylan J. Lundgren? 


er- and polyfluoroalkyl substances 

(PFAS)—sometimes called “forever 

chemicals” because of their chemical 

stability—are synthetic molecules 

widely used in consumer products 

(1). Ironically, the characteristics that 
make these compounds useful, such as lipo- 
and hydrophobicity, high thermal stability, 
and resistance to chemical degradation, also 
lead to pervasive contamination in the en- 
vironment (2). Moreover, chronic exposure 
to these chemicals has been associated with 
a host of negative human health effects (3). 
Unfortunately, the carbon-fluorine bonds in 
PFAS are among the most inert in organic 
compounds, which means that their destruc- 
tion requires brutal treatment, such as incin- 
eration at high temperatures. On page 839 
of this issue, Trang et al. (4) show that under 
specific mild conditions, perfluoroalkyl car- 
boxylic acids (PFCAs), which are a type of 
PFAS, undergo spontaneous decomposition 
into benign inorganic fluoride ions and sim- 
ple oxygenated organic molecules. 

Ideally, all anthropogenic chemicals would 
easily break down to innocuous compon- 
ents once released into the environment. 
However, many of the approximately 5000 
different PFAS are persistent. Several PFAS 
do go through transformations in the en- 
vironment and become PFCAs (5), although 
once they have reached this stage, they re- 
main a persistent contaminant. 

PFAS can be destroyed by incineration 
(6), electro- or photochemical oxidation, or 
reductive defluorination. However, these 
methods require substantial energy input 
and are usually expensive to operate. Trang 
et al. discovered a low-energy pathway for 
decomposing PFCAs in which decarboxyla- 
tion of the compound’s acid group turns it 
into carbon dioxide (CO,) and a perfluorin- 
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ated carbanion (a trivalent carbon anion). 
The carbanion can then undergo a series 
of chemical reactions to eventually break 
all the carbon-fluorine (C-F) bonds in the 
parent compound, producing inorganic 
fluoride and a combination of CO,, formate 
(CHO,), and other oxygenated small mol- 
ecules that no longer contain C-F units. The 
reaction occurs under mild temperatures 
(80° to 120°C). Outside of the reaction 
media, only the addition of sodium hydrox- 
ide (NaOH) is needed for the reaction to 
occur, making it potentially much cheaper 
than the processes currently used. 

In addition to providing a simple approach 
to PFAS destruction, Trang et al. explored 
mechanisms of the transformation by look- 
ing at reaction mixtures, kinetic studies, and 
computational analysis. The key elementary 
chemical step that leads to a cascade of C-F 
bond breaking is decarboxylation of the acid 
group in the molecule. Decarboxylation of 
a related class of chemicals, perfluoroalkyl 
ether carboxylic acids, was previously ob- 
served as an inconvenient stability issue (7 
8). The key to the observed decarboxylation 
by Trang et al. is the addition of the polar 
aprotic solvent dimethyl sulfoxide (DMSO) 
(see the figure). In solvents such as water, 
PFCA is extremely stable against degrada- 
tion, but in DMSO, the reaction proceeds. 
This simple difference explains how a mol- 
ecule so persistent in the environment can 
degrade so readily in the lab. 

Past studies have shown the importance 
of selecting the right solvent to enable 
low-temperature uncatalyzed decarboxyl- 
ation reactions (9, 10). It is necessary to 
have reaction media that stabilize charge 
build-up through dipole-dipole interactions 
without overly stabilizing the carboxylic 
acid itself through hydrogen bonding (JZ). 
With a large dipole moment, the DMSO 
identified by Trang et al. is well-suited to sta- 
bilize charged reactive intermediates with- 
out forming strong hydrogen bonds with the 
carboxylate substrate. In quantified terms, 
the decarboxylation step was estimated by 
computational analysis to be the rate-deter- 
mining step, with an activation barrier of 28 
kcal/mol. This was in close agreement with 
the experimental value of 30 kcal/mol. 

In Trang et al.’s experiment, after the per- 
fluoroalkyl anion is generated, the break- 
down of the fluorocarbon is energetically 
favorable and fast because the subsequent 
reaction steps have relatively low kinetic 
barriers. The perfluoroalkyl anion eliminates 
a fluoride ion to give a fluorinated alkene. 
The strongly electron-withdrawing C-F 
bonds make the fluorinated alkene unit elec- 
tron-poor and thus highly reactive toward 
the addition of hydroxide. This process pro- 
ceeds rapidly and leads to the expulsion of 
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another fluoride ion. This step produces an 
unstable acyl fluoride intermediate, which 
then undergoes several reactions to shed 
every fluorine atom from the organic frame- 
work, resulting in a mixture of innocuous 
oxygenated organic molecules. 

The proposed mechanisms for PFCA 
degradation help to explain the optimized 
reaction conditions. DMSO is the solvent 
needed for decarboxylation, but aqueous 
sodium NaOH is also needed to promote 
defluorination of the perfluoroanion to com- 
plete the reaction. In the end, the solution 
used by Trang et al. was 1 part water to 8 parts 
DMSO-—and 30 equivalents of NaOH per 


Breaking down forever chemicals 
Per- and polyfluoroalky! substances are contaminants 
typically resistant to degradation, leading to their 
accumulation in the environment. Trang et al. 
designed a chain of reactions that can decompose 
perfluoroalkyl carboxylic acids (PFCAs)—a kind 

of such chemicals—into relatively harmless fluoride 
ions and oxygenated organic molecules. 


@ Fluorine Carbon @ Oxygen OHydrogen 


Perfluorooctanoic acid is 
a typical PFCA that is 
difficult to break down. 


Under the right conditions 
using a dimethyl! 

sulfoxide (DMSO)/H,0 
and NaOH mixture, 

the molecule loses CO, 
and is converted to a 
reactive perfluoroanion. 


The molecule loses a 
fluoride ion because it is 
unstable, and becomes 

a perfluoroalkene, 

which can be readily 
converted to a series of 
nonfluorinated molecules 
and trifluoroacetate 

by acascade of reactions 


in basic water/DMSO. 
Fluoride Carbonate Glycolate 
(F>) (CO,*) (C,H,0;) 


oy Af 


Trifluoroacetate 
(C,F,0,-) 


Oxalate 
(C,0,7) 


Formate 


molecule of perfluorooctanoic acid (PFOA), 
which comes out to roughly the same amount 
of NaOH as the water by weight. 

In addition to the perfluorooctanoic acid 
used in their initial experiment, Trang et al. 
examined other PFCAs with varying carbon 
chain lengths, which also undergo similar 
degradation under the described experi- 
mental condition. In general, >80% fluoride 
recovery is observed for acids with a carbon 
chain between four and nine carbons long. 
In cases of one- or two-carbon perfluorocar- 
boxylic acids, decarboxylation is slower be- 
cause less-stable perfluoroalkyl anion inter- 
mediates are produced. The mechanism of 
degradation for these PFAS contrasts with 
those described in previous studies, which 
postulate sequential one-carbon removals 
along the chain to generate increasingly 
shorter molecules (72). Trang et al. demon- 
strate an approach to PFAS degradation that 
also works for branched perfluoroalkyl ether 
carboxylic acids, such as the hexafluoro- 
propylene oxide dimer acid. Often known 
by its trade name “GenX,” the chemical has 
emerged as a replacement for certain PFAS 
and is now a ubiquitous environmental con- 
taminant itself (13). Computational analysis 
by Trang et al. suggests a similar pathway for 
decomposition for GenX compared to PFOA, 
with initial substrate decarboxylation occur- 
ring at 40°C. 

Given that nearly 5000 different PFAS are 
in use, it is imperative to understand the 
fundamental reactivity of these molecules. 
Especially important is the fact that most 
PFAS eventually form PFCAs, which remain 
in the environment practically forever (74). 
Trang et al. provide insight into how these 
seemingly robust compounds can undergo 
nearly complete decomposition under un- 
expectedly mild conditions. Hopefully, the 
fundamental findings of Trang et al. can 
be coupled with efficient capture of PFAS 
from contaminated environmental sites to 
provide a possible solution to the forever 
chemical problem. 
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NEURODEGENERATION 


Toward preventing Parkinson’s disease 


A glycoprotein is a possible biomarker of pathogenic progression for Parkinson's disease 


By Brit Mollenhauer! and 
Christine A. F. von Arnim? 


he diagnosis of Parkinson’s disease 

(PD) relies on the presence of motor 
symptoms that occur when 50 to 70% 

of dopaminergic neurons in the sub- 
stantia nigra degenerate. However, 

this happens after at least 20 years 

of the prodromal (premotor) phase. The 
spreading of a-synuclein (aSyn) aggregates, 
called fibrils, is pathologically associated 
with PD (1). However, the mechanisms 
leading to neurodegeneration continue to 
be debated. Alterations in the gene encod- 
ing aSyn are associated with increased risk 
of PD. Similarly, single-nucleotide polymor- 
phisms (SNPs) that drive expression of gly- 
coprotein nonmetastatic melanoma protein 
B (GPNMB) are associated with increased 
risk of PD (2, 3). On page 833 of this issue, 
Diaz-Ortiz et al. (4) report confirmation that 
SNPs increase GPNMB expression and con- 
fer PD risk. GPNMB was increased in the 
blood of PD patients and associated with 
disease severity, suggesting that GPNMB 
could be a biomarker of PD progression. 
GPNMB is a transmembrane 
glycoprotein, which was first de- 
scribed in 1995 as more highly 
expressed in low-metastatic 
melanoma cell lines. The extra- 
cellular domain can be cleaved, 
and the soluble fragment binds 


ity and lysosomal dysfunction, which may 
lead to aSyn accumulation through the au- 
tophagic-lysosomal pathway (6). 

Increased concentrations of GPNMB have 
been described in postmortem brain samples 
from patients with PD as well as samples 
from patients with Alzheimer’s disease. 
Increased amounts of GPNMB in cerebro- 
spinal fluid (CSF) have also been suggested 
as a promising biomarker in other neuro- 
logical disorders, such as amyotrophic lateral 
sclerosis (ALS) (7). However, measurements 
of GPNMB concentrations in CSF could not 
distinguish between Alzheimer’s disease and 
other neurological diseases, but they cor- 
related with parameters such as aging and 
phosphorylated Tau in CSF (a marker of 
Alzheimer’s disease), which indicates that 
increased GPNMB in CSF is not specific (8). 

Neuroinflammation and activation of glia 
have been proposed to be possible mecha- 
nisms of neurodegeneration, suggesting a 
link between aSyn aggregation, neurode- 
generation, and neuroinflammation in PD. 
GPNMB appears to be associated with the 
suppression of inflammatory processes and 
has been demonstrated to reduce proinflam- 


Changes in Parkinson’s disease 
Increases of pathology and possibly neu 
before diagnosis of Parkinson's disease (PD) (year 0). People at risk of PD or disease 
progression could be identified with panels of biomarkers, including imaging with 


oinflammation can occur up to 30 years 


matory cytokine secretion in macrophages 
outside of the brain (9). Conversely, GPNMB 
has been proposed to play a role in neuro- 
inflammation and was found in microglia in 
the substantia nigra of PD patients by single- 
cell sequencing (10). 

To analyze GPNMB function in neurons, 
Diaz-Ortiz et al. depleted its expression in 
human induced pluripotent stem cell (iPSC)- 
derived neurons and observed a decrease 
in synaptosomal aSyn. Further analyses re- 
vealed that aSyn and GPNMB interact. aSyn 
was found to be a hub for protein-protein 
interactions of synapse-related proteins, 
which were affected by the loss of GPNMB. 
Moreover, loss of GPNMB in iPSC-derived 
neurons specifically prevented internaliza- 
tion of aSyn fibrils. This study shows that 
GPNMB promotes synaptic uptake and cel- 
lular pathology of aSyn. To date, GPNMB has 
been associated with neuroinflammation and 
microglia but has not been directly linked 
to neurons. It will be important to assess 
whether GPNMB is also directly involved in 
aSyn uptake by microglia. This could enable 
understanding of the overall and GPNMB- 
specific roles of inflammatory reactions in PD 
and whether they are a cause or 
consequence of neurodegenera- 
tion. Promoting loss of GPNMB 
could also be a therapeutic strat- 
egy for early PD. 

Neuroprotective strategies 
have largely failed to prevent 


many receptors, triggering an 
intracellular response. GPNMB 
has different functions in dif- 
ferent cell types, and its roles 
in metastasis and inflamma- 
tion are still a matter of debate 
(5). In 2016, GPNMB was found 
to be highly expressed in the 
brains of patients with Gaucher 
disease. This lysosomal storage 
disorder is caused by muta- 
tions in the glucocerebrosidase 
(GBA) gene, which is also an 
important risk factor for PD. 
Mutations in GBA may lead to 
loss of glucocerebrosidase activ- 
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neurodegeneration in PD for 
various reasons, including be- 
cause targeting the complex 
molecular mechanisms involved 
is challenging. There are diverse 
contributing genetic and envi- 
ronmental factors, and identi- 
fying patients undergoing the 
prodromal phase of the disease 
is difficult. Imaging dopamine 
transporter activity with single- 
photon emission computed to- 
mography (SPECT) is currently 
used in the differential diagnosis 
of PD and to confirm early PD. 
Therefore, there is an urgent 
need to not only better under- 
stand the pathophysiology of 
PD but also establish a panel of 
biomarkers that identifies indi- 
viduals at risk of developing PD, 
similar to emerging blood-based 
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biomarker panels for Alzheimer’s disease (11). 

Diaz-Ortiz et al. found that there were 
increased concentrations of GPNMB in 
the plasma of 781 early symptomatic and 
advanced PD patients compared with 59 
healthy controls and that the concentra- 
tions increase with disease stage, indicating 
that GPNMB could be a biomarker of dis- 
ease progression. Despite previous findings, 
the authors found no changes in GPNMB in 
CSF. Because GPNMB concentrations are in- 
creased in the CSF of ALS patients, further 
studies should assess GPNMB in CSF and 
blood from patients with other neurodegen- 
erative diseases, including dementia with 
Lewy bodies, multiple system atrophy, and 
progressive supranuclear palsy. The changes 
in plasma GPNMB with PD stage also need to 
be validated in longitudinal samples from PD 
patients, and prodromal participants should 
be optimally included to assess the suitability 
of plasma GPNMB as a biomarker. 

GPNMB seems not to work as a single pre- 
dictive marker, so panels of biomarkers will 
be needed. The recently developed seeding 
aggregation assays for aSyn in CSF (72) come 
close to an ideal marker for indicating PD, 
with sensitivity and specificity both above 
90% (13). Tau, phosphorylated Tau, and neu- 
rofilament light chain (14, 15), as well as mi- 
croRNAs, may also be added to the biomarker 
panel, perhaps along with GPNMB (see the 
figure). This could allow people with early 
disease, or perhaps even prodromal disease, 
to be identified, and it would be a useful ad- 
dition to allow monitoring in clinical trials. 
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IMMUNOLOGY 


Unveiling the B cell 
receptor structure 


Molecular structures provide a road map for understanding 
and controlling B cell receptor activation 


By Pavel Tolar! and Susan K. Pierce? 


cells took center stage in the CO- 

VID-19 pandemic as the world 

waited anxiously for a _ vaccine. 

Nearly all vaccines in use today de- 

pend on activating B cells by bind- 

ing to the surface-expressed B cell 
receptors (BCRs) that recognize antigens 
and stimulate antibody production. How- 
ever, even recent vaccines were developed 
without a detailed understanding of how 
BCRs function, and many fail to provide 
optimal long-lived immunity. On pages 875 
and 880 of this issue, Su et al. (1) and Ma 
et al. (2), respectively, reveal the structures 
of the BCR complex, which is composed of 
membrane forms of antibodies [immuno- 
globulin M (IgM), IgG, etc.] bound to two 
transmembrane proteins, Iga(CD79A) and 
Ig8 (CD79B). These structures are likely to 
inform the development of highly effective 
vaccines, therapies that eliminate cancers, 
and treatments that control and prevent au- 
toimmune diseases. 

Earlier studies showed that the mem- 
brane antibody of the BCR has the familiar 
Y shape, with two fragment antigen-binding 
(Fab) arms connected through hinges to 
one fragment crystallizable (Fc) leg, which 
in this case is linked to a transmembrane 
domain. Iga and IgB chaperone the mem- 
brane antibody to the B cell surface and, 
crucially, initiate biochemical signaling 
inside the B cell when the membrane anti- 
body binds antigen (3, 4). The critical inter- 
actions that hold the membrane antibody 
to Iga and Ig are buried within the cell 
membrane (5), making the BCR resistant to 
structural studies by crystallography. 

The BCR is a multifunctional machine 
that is responsible for regulating most func- 
tions of a B cell. Under the appropriate 
conditions, antigen binding triggers B cell 
proliferation and differentiation into anti- 
body-secreting cells. However, this activa- 
tion requires second signals, without which 
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BCR signaling leads to cell death. Many 
second signals are provided by CD4* helper 
T cells. To solicit T cell help, the BCR func- 
tions as an endocytic receptor that captures 
the antigen and delivers it to intracellular 
vesicles for proteolytic processing and load- 
ing onto major histocompatibility complex 
class II (MHC II) molecules, which are rec- 
ognized by antigen-specific helper T cells. 
T cells also stimulate class-switching from 
the default IgM and IgD antibodies to IgG, 
IgA, or IgE. All of these antibody classes 
form distinct BCRs on B cell surfaces and 
fine-tune the responses to antigens (6). For 
example, IgG BCRs stimulate antibody se- 
cretion better than IgM BCRs. 

Another important feature of the BCR is 
sensing the affinity of the antigen binding. 
Rather than being an on-off switch, the BCR 
grades the responses to advantage B cells 
that are better at recognizing the antigen at 
the expense of B cells that bind it poorly. This 
is the basis for antibody affinity maturation, 
which is mediated by the selection of B cells 
that have acquired somatic mutations in the 
antigen-binding site of their antibodies (7). 
The BCR is not idle in the absence of anti- 
gen but provides survival signals to resting 
B cells (8). Clearly, understanding all the nu- 
ances of B cell activation is not possible with- 
out a detailed structure of the BCR. 

The physical signal through which anti- 
gen binding is transmitted through the BCR 
to the interior of the cell has itself been an 
enigma. Originally, antigens were thought 
to simply aggregate BCRs on the B cell sur- 
face because antigens are overwhelmingly 
multivalent (e.g., virus particles). BCR ag- 
gregation was an attractive mechanism 
because it obviated the need for conforma- 
tional changes induced by antigen binding 
to be propagated to the cytoplasm through 
the highly flexible hinge region of the BCR 
antibody. Instead, the model proposed that 
aggregation of the Iga-Ig8 intracellular 
domains leads to cross-phosphorylation of 
their immunoreceptor tyrosine-based acti- 
vation motifs (ITAMs) by associated tyro- 
sine kinases (9). However, phosphorylation 
was observed to spread to BCRs that were 
not engaged by antigen, suggesting that the 
process can also work the opposite way: 
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Resting BCR oligomers can be separated 
by antigen binding, exposing their ITAM 
phosphorylation sites (J0). Furthermore, 
given that the BCR often needs to wrestle 
antigens from other cell surfaces, a me- 
chanical tug on the BCR could induce 
distant conformational changes or cluster- 
ing (11), likely in conjunction with regula- 
tion from the submembrane cytoskeleton, 
which helps to keep the BCR in the resting 
state (12). The new structures presented by 
Ma et al. and Su et al. are the first steps 
toward testing these models in B cell acti- 
vation or pathology. 

To provide a high-resolution view of the 
BCR, both groups purified the BCR from 
cells and imaged them using cryo-electron 
microscopy (cryo-EM). It took more than 6 
million single BCR particles to refine each 
of the final structures up to atomistic de- 
tail. Both papers present structures of the 
IgM BCR, and Ma et al. also report that of 
the IgGl BCR (see the figure). The struc- 
tures agree with each other and confirm 
in vivid detail what was previously only 
depicted in schematic cartoons. The mem- 
brane antibody retains its Y shape in which 
the Fab arms remain free, whereas the Fc 
portion binds the Iga-IgB heterodimer with 


1:1 stoichiometry (J0). Linker peptides in- 
troduce the extracellular portions into the 
cell membrane, where the transmembrane 
helices form a tight bundle, glued together 
by numerous polar residues interacting in- 
side the hydrophobic environment. These 
intramembrane interactions are highly 
conserved across species and across differ- 
ent antibody classes and explain why this 
region is so critical for BCR assembly (5). By 
contrast, the cytoplasmic tails of the BCR 
are invisible in the reconstruction, indicat- 
ing that they are flexible. 

The structures also reveal unexpected 
features. They demonstrate the central im- 
portance of the linker peptides. The tightly 
knitted linkers are stabilized in specific 
positions by a network of interactions, sug- 
gesting that they are essential in keeping 
the BCR together and in the correct ori- 
entation. For example, one of the antibody 
linkers threads in between Iga and Ig. 
Therefore, the BCR complex must assemble 
from the individual chains while they are 
still folding and will be difficult to disman- 
tle once fully formed. 

Although the interactions within the 
membrane are conserved between the IgM 
and the IgG BCRs, the two BCRs differ in 


The structure of the BCR 


Ma et al. and Suet al. present cryo-electron 
microscopy structures of the immunoglobulin M (IgM) 
B cell receptor (BCR), and Ma et al. also report that of 
the IgGl BCR. The membrane antibody, with two 


fragment antigen-binding (Fab) arms connected through 
hinges to one fragment crystallizable (Fc) leg, is linked to 


a transmembrane domain that threads between Iga 
and IgB. The cytoplasmic tails of the BCR are not seen 
in the structures, and the Fab is not seen in IgG1. 
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their extracellular domains. The IgM mem- 
brane antibody sits closer to the membrane 
and interacts with Iga-IgB on its side. By 
contrast, the IgG is spaced further out and 
sits on top of Iga-Ig8. This structural dis- 
parity may underlie some of the signaling 
differences between IgM and IgG BCRs. 
It also predicts that BCRs of other classes 
adopt distinct structures, which may be 
important for class-specific functions and 
therapeutic targeting. Additionally, the Fab 
arms of the IgM BCR show only limited 
flexibility compared with soluble IgM (13). 
Apparently, association with Iga alters the 
position of one of the Cu3 domains in the 
IgM Fc, which in turn stabilizes the hinge 
movement of the Cu2 domains. This sug- 
gests that in the IgM BCR, there is a com- 
munication between the Fab movement 
and the Iga-Ig8 heterodimer. It will be in- 
teresting to see whether such communica- 
tion is involved in signal transmission after 
antigen binding and how this may contrast 
with the IgG BCR, in which the Fab arms 
move freely. 

Where might these pioneering BCR struc- 
tures lead us? One hope is that cryo-EM 
can be used to solve the structures of the 
antigen-engaged BCR complexes. Activated 
BCR structures could address the mecha- 
nisms by which antigens stimulate B cells 
and guide the engineering of vaccines for 
optimal B cell activation (1/4). Another area 
of interest is to prevent unwanted activation 
of B cells. Rational approaches to inhibit 
the BCR may help to quell the recognition 
of self-antigens by B cells in autoimmunity. 
Similarly, targeting the BCR can be benefi- 
cial in chronic lymphocytic leukemia and 
some types of B cell lymphomas in which 
BCR signaling drives malignant prolifera- 
tion (5). Inhibiting intracellular signaling 
or marking the diseased cells for elimina- 
tion will extend the range of therapeutic op- 
tions in these diseases. & 


REFERENCES AND NOTES 


1. Q Suetal., Science 377, 875 (2022). 
2. X.Maetal., Science 377, 880 (2022). 
3. J.Hombach, T. Tsubata, L. Leclercq, H. Stappert, M. Reth, 
Nature 343,760 (1990). 
4. K.S.Campbell, J.C. Cambier, EMBO J.9,441 (1990). 
5. A.C.Shawetal., Cell 63, 381 (1990). 
6. A.R.Venkitaraman, G.T. Williams, P. Dariavach, M.S. 
Neuberger, Nature 352, 777 (1991). 
7. G.D.Victora, M.C. Nussenzweig, Annu. Rev. !mmunol. 
40, 413 (2022). 
8. K.-P.Lam,R. Kiihn, K. Rajewsky, Cell 90, 1073 (1997). 
9. M.Reth, J.Wienands, Annu. Rev. lmmunol.15, 453 
(1997). 
10. W.W.A.Schamel, M. Reth, /mmunity 13,5 (2000). 
11. P.Tolar, J. Hanna, P. D. Krueger, S. K. Pierce, Immunity 30, 
44 (2009). 
12. B. Treanor et al., /mmunity 32,187 (2010). 
13. Y.Lietal., Science 367, 1014 (2020). 
14. A.P.West Jr. etal., Cell 156, 633 (2014). 
15. R.M. Young, A.L. Shaffer Ill, J.D. Phelan, L.M. Staudt, 
Semin. Hematol.52, 77 (2015). 


10.1126/science.add8065 


science.org SCIENCE 


GRAPHIC: V. ALTOUNIAN/SCIENCE 


VIEWPOINT: COVID-19 


T cell immunity to COVID-19 vaccines 


T cell immunity may be critical for long-term protection by COVID-19 vaccines 


By E. John Wherry! and Dan H. Barouch? 


he development of multiple COVID-19 

vaccines in record time is a major bio- 

medical achievement, but mechanistic 

immune correlates of vaccine protec- 

tion remain to be determined. Most 

studies on COVID-19 vaccines have 
focused on neutralizing antibody (NAb) re- 
sponses, with little emphasis on cellular im- 
munity. However, accumulating data suggest 
that T cell responses play an important role in 
vaccine protection against severe COVID-19 
disease, particularly against viral variants 
that partially escape from recognition by 
NAbs. These insights have implications for 
using current COVID-19 vaccines and for 
developing next-generation vaccines against 
COVID-19 and other infectious diseases. 

Protective immunity can be induced by 
vaccination or infection and is mediated by 
two arms of the adaptive immune system: hu- 
moral immunity mediated by antibodies and 
memory B cells, and cellular immunity that 
includes helper CD4* T cells and cytotoxic 
CD8* T cells. Antibodies block infection by 
binding virus and preventing viral entry into 
host cells and are correlates of protection for 
many vaccines. For severe acute respiratory 
syndrome coronavirus-2 (SARS-CoV-2), anti- 
bodies can prevent acquisition of infection if 
they are present at high enough concentra- 
tions, as demonstrated by immunoglobulin G 
(IgG) transfer studies in macaques (1). 
Memory T cells can provide an impor- 

tant additional layer of protective immunity. 
Because T cells do not recognize infecting 
viruses until they have entered the host cell, 
the mechanisms of protection by T cells are 
different from those of antibodies. For ex- 
ample, T cells cannot prevent host cells from 
initially becoming infected, but they can re- 
spond rapidly once infection has occurred 
to limit virus replication and spread (see the 
figure). Thus, T cell immunity likely does not 
prevent acquisition of infection. However, 
emerging evidence supports a key role for 
SARS-CoV-2-specific T cell immunity in pro- 
tection from severe disease. 


Unstitute for Immunology and Department of Systems 
Pharmacology and Translational Therapeutics, University 
of Pennsylvania Perelman School of Medicine, Philadelphia, 
PA, USA. Center for Virology and Vaccine Research, Beth 
Israel Deaconess Medical Center, Harvard Medical School, 
Boston, MA, USA. Email: wherry@pennmedicine.upenn.edu; 
dbarouch@bidmc.harvard.edu 


SCIENCE science.org 


Immunological memory to SARS-CoV-2 
from vaccination or infection can protect the 
host through multiple mechanisms. If virus 
breaches NAb defenses in the upper respi- 
ratory tract, protection from severe disease 
could still be mediated if immune mecha- 
nisms prevent virus spread to the lower res- 
piratory tract and control virus replication 
in the lungs. Such protection can involve 
antibodies, but T cells are ideally suited to 
limit virus replication by eliminating virus- 
infected cells. Thus, although an ideal vac- 
cine would prevent acquisition of infection, 
immune responses that operate rapidly and 
efficiently after initial infection could prevent 
progression to severe disease. Indeed, even 
for vaccines with outstanding and durable 
efficacy, such as the measles vaccine, clinical 
protection may be mediated by rapid control 
of virus replication in infected cells and pre- 
vention of disease progression (2). 

For SARS-CoV-2, there are four main goals 
of vaccination: protection from acquisition of 
infection; prevention of transmission; pro- 
tection from severe disease; and prevention 
of Long Covid. When vaccines were initially 
rolled out, the early high NAb titers following 
immunization led to hopes that the vaccines 
would largely block infection and transmis- 
sion. However, two issues have emerged that 
question whether these goals are still achiev- 
able with current vaccines. The high NAb 
titers observed at peak immunity several 
weeks after messenger RNA (mRNA) vacci- 
nation wane rapidly and substantially, often 
within 4 to 6 months (3, 4). NAb titers after 
adenovirus type 26 (Ad26) vector vaccina- 
tion are more durable over time but peak at 
lower levels (4). The serum concentration of 
NAbs needed to provide durable protection 
from infection and transmission remains un- 
clear but is likely much higher for the current 
highly infectious SARS-CoV-2 variants. 

Additionally, SARS-CoV-2 variants such 
as Omicron show increased transmissibility 
and substantial escape from NAb responses, 
both of which reduce the ability of vaccine- 
induced NAbs to block infection (5, 6). The 
combination of waning antibody levels and 
the emergence of viral variants has now 
forced a recalibration of the expectations of 
current COVID-19 vaccines. Protective effi- 
cacy against acquisition of infection appears 
to be transient and limited against Omicron, 
even after third and fourth mRNA vaccine 
boosters (7, 8). By contrast, protection against 


severe disease has largely been maintained in 
otherwise healthy individuals. For example, 
data from South Africa during the Omicron 
surge have shown that both BNT162b2 and 


Ad26.COV2.S vaccines still provided robust 


protection against hospitalization even in 
the absence of high-titer NAbs (5, 6, 9). These 
data suggest that other mechanisms protect 
from severe disease. 

Multiple layers of the immune system 
contribute to immunological memory and 
protective immunity to viruses. Antibody re- 
sponses, produced by B cells, are generated 
in two waves. Short-lived plasma cells are 
usually generated without entering a ger- 
minal center (GC) and function to rapidly 
produce high titers of low-quality antibodies. 
A second set of B cells enters the GC, where 
they undergo somatic hypermutation and af- 
finity maturation to generate higher-quality 
antibodies. Long-lived plasma cells emerge 
from the GC and migrate to the bone marrow, 
where they produce high-quality, affinity-ma- 
tured antibodies. The other output of the GC 
is a pool of memory B cells, the cellular stor- 
age unit of high-quality antibody sequences. 
These cells persist, and upon reinfection (or 
booster vaccination), can rapidly convert into 
plasma cells to make new antibodies and 
seed new GCs to reinitiate antibody affinity 
maturation. For COVID-19 vaccines, although 
serum antibody titers wane rapidly, memory 
B cells are highly durable and may contribute 
to protection from disease along with mem- 
ory T cells (10, 11). 

T cells recognize short peptides presented 
on the cell surface in complex with human 
leukocyte antigen (HLA) class I or class IT 
molecules. Upon recognition of their cognate 
peptide presented by HLA molecules, mem- 
ory T cells can rapidly elaborate effector func- 
tions to suppress viral replication, limit infec- 
tion, and prevent spread within the host. For 
example, CD8* T cells directly kill infected 
cells and produce antiviral cytokines, as well 
as inflammatory molecules that recruit addi- 
tional immune cells to sites of infection. CD4* 
T cells contribute to various aspects of immu- 
nity, including supporting B cell responses 
in the GC; some CD4* T cells can also have 
direct antiviral properties similar to those 
of CD8* T cells. Once generated, memory T 
cells can be stably maintained for decades in 
humans, and SARS-CoV-2-specific memory T 
cells durably persist after vaccination or in- 
fection (11, 12). 
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Because antibodies and T cells recog- 
nize virus and contribute to protection by 
different mechanisms, the impact of viral 
mutations on immune escape is distinct. 
NAbs recognize conformational epitopes 
on viral proteins and typically mediate 
their effects by blocking engagement of a 
viral coat protein with the host cell entry 
receptor. In the case of SARS-CoV-2, NAbs 
bind to the spike protein receptor bind- 
ing domain (RBD) and N-terminal domain 
(NTD), blocking engagement with the host 
receptor, angiotensin-converting enzyme 2 
(ACE-2). Mutations in the spike RBD and 
NTD can substantially influence antibody 
binding. Immune pressure from antibod- 
ies is likely driving evolution of the spike 
protein, resulting in incomplete neutraliza- 
tion of new viral variants by NAbs induced 
by vaccination or infection (5, 6). Similarly, 
multiple therapeutic monoclonal antibod- 
ies have lost efficacy against current SARS- 
CoV-2 variants. 

By contrast, T cells recognize short, 8- 
to 15-amino acid linear peptides, which 
are not limited to the spike RBD and NTD 
domains, where most mutations occur. As 
a result, T cell responses remain largely 
intact against variants such as Omicron, 
with >80% of T cell epitopes conserved 
across variants (12, 13). Moreover, if escape 
from a T cell epitope occurs, differences in 
HLA-peptide presentation suggest that a 
mutation that causes escape from T cell im- 
munity in one person is unlikely to do so 
in another person. Overall, emerging viral 
variants substantially affect antibody neu- 
tralization but so far have had a minimal 
impact on T cell responses. 

What is the evidence that memory T 
cells contribute to protective immunity 
to SARS-CoV-2? In cancer patients with B 
cell deficiencies that experience COVID-19, 
CD8* T cell responses correlated with 
milder disease (J4). CD8* T cell depletion 
studies in macaques have demonstrated a 
contribution of CD8* T cells to protection 
from SARS-CoV-2 challenge (7). In addi- 
tion, vaccine failures against experimental 
Omicron challenge in macaques were as- 
sociated with a lack of Omicron-specific 
CD8* T cells, despite moderate Omicron 
NAb titers (75). Moreover, robust protec- 
tion against severe disease in the absence 
of high NAb titers (5, 9) suggests a role for 
T cell responses. There have been surges in 
SARS-CoV-2 infections driven by Omicron 
subvariants that largely escape NAb re- 
sponses, but hospitalization, intensive 
care unit admission, and death rates have 
not increased proportionally. The discon- 
nect between infection and severe disease 
suggests a substantial level of population 
immunity, which likely includes both hu- 
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Neutralizing antibodies and T cells 
in COVID-19 vaccine efficacy 


When vaccines induce high titers of neutralizing 
antibodies (NAbs), severe acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2) infection of the upper 
respiratory track is blocked. But, when NAb titers 
wane or the virus evades antibody recognition, robust 
T cell responses block progression of infection to the 
lower respiratory tract. Disease progression occurs 
with low titers of NAbs and weak T cell responses. 
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moral and cellular immunity. Together, 
these observations support a role for cel- 
lular immunity, and particularly CD8* 
T cell responses, in contributing to vac- 
cine protection against severe COVID-19. 
Furthermore, the durability and reactivity 
of CD8* T cells against variants (4, 1/—13) 
suggest their relevance for preventing se- 
vere disease against viral variants that in- 
creasingly escape NAbs. 

A key question moving forward is how 
to optimize T cell immunity with COVID-19 
vaccines. Boosting with current mRNA 
or vector-based vaccines should increase 
spike-specific cellular immunity, but better 
understanding of which T cell subpopula- 
tions protect against disease is needed. 
Adding into next-generation vaccines other 


immunogens, such as SARS-CoV-2 nu- 
cleocapsid or membrane proteins, or con- 
served regions, represents another strategy 
to broaden cellular immunity. Such ap- 
proaches may also contribute to the devel- 
opment of pan-betacoronavirus vaccines. 
Whether vaccine-induced memory T cells 
establish long-term residence at sites of 
mucosal entry for durable protection is also 
important to address. 

Another knowledge gap is whether up- 
dated booster vaccines will improve clini- 
cal efficacy compared with current vaccine 
boosters. Recent clinical data suggest that 
boosting with bivalent mRNA vaccines ex- 
pressing the ancestral and Omicron BA.1 
spike resulted in less than twofold higher 
Omicron BA.1 NAb titers compared to 
boosting with vaccines based on the ances- 
tral spike. T cell responses have not yet been 
reported with these updated vaccines. The 
clinical relevance of the modestly increased 
NAb titers for protection against infection 
with Omicron remains unclear. 

Future research should define the precise 
mechanisms by which T cells contribute to 
vaccine efficacy, including the role of mu- 
cosal resident T cells, optimal memory T 
cell differentiation states, and the role, if 
any, of SARS-CoV-2 escape from T cell im- 
munity. T cell responses should be included 
in studies of immune correlates of protec- 
tion. Moreover, future studies should define 
optimal methods for monitoring T cell re- 
sponses. A deeper understanding of the role 
of T cell immunity for protection against 
SARS-CoV-2 infection and disease should 
provide a foundation for improving the use 
of current vaccines and the development of 
next-generation vaccines. 
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Transforming urban 


environments 


Resilient cities of the past hold lessons for creating 
adaptable communities of the future 


By Margaret A. Palmer 


ities are highly dynamic socio-envi- 

ronmental systems in which people, 

communities, and institutions interact 

with mosaics of green space, contami- 

nated brown fields, and remnants of 

once thriving ecosystems. Over diverse 
time scales, these systems can thrive, decline, 
or reinvent themselves across multiple spa- 
tial scales. And while economic forces, envi- 
ronmental threats, urban unrest, 
and other challenges may result in 
the demise of some cities, others 
are able to persist—often by rein- 
venting themselves. 

A new book titled If the Past 
Teaches, What Does the Future 
Learn? argues that communities 
that were able to pivot in the face 
of change in the past provide key 
lessons in how to strengthen cit- 
ies’ adaptability moving forward. 
The book’s authors suggest that 
these historical examples can 
“inspire new possibilities, provide 
useful guiding principles, and suggest solu- 
tions to new challenges” cities face today. 

The book—resulting from activities con- 
ducted as part of the Integrated History and 
Future of People on Earth (IHOPE) project 
and edited by archaeologists Carol Crumley 


2022. 
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If the Past 


Teaches, What Does 
the Future Learn? 
John T. Murphy and 

Carole L. Crumley, Eds. 
TU Delft Publishing, 


and John Murphy—opens with an engaging 
chapter on what the authors call “durability,” 
the capacity of a city to either adapt or suc- 
cumb to change (1). To illustrate the concept, 
they carefully select case studies that capture 
the variability in cities around the world and 
through time. From highland Ethiopia to the 
Maya lowlands, Atlantic Europe, and south- 
ern Mesopotamia, the authors use these 
places, their landscapes, and their regional 
settings to “build a bridge from archaeology 
to mainstream architectural and 
design theory.” Although there are 
examples of extant cities that have 
prehistoric antecedents (e.g., Phoe- 
nix, Arizona) and exhibit durabil- 
ity, archaeological examples are the 
primary focus. 

“How can we transform urban 
environments to encourage dura- 
bility and mediate the social price 
of myriad risks and vulnerability?” 
ask the book’s authors. They argue 
that there are guiding management 
principles that have led to urban 
durability in the past that we should 
apply today. These include maximizing diver- 
sity in all its forms—natural resources, peo- 
ple, ideas, views, institutions—which allows 
for flexibility and, in turn, permits adaptation 
to changing forces and challenges. 

The authors also advocate for the integra- 
tion of the city in its natural setting—that is, 
ensuring that it optimizes inherent natural 
cycles and reuse principles, thereby tapping 
into the self-sustaining processes built into 
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Iranian wind towers, an ancient and natural cooling 
system, could inform modern urban planning. 


nature. Doing so avoids the overexploita- 
tion and waste of resources that can lead 
to fragility. 

They encourage cities to maximize multi- 
functionality by capitalizing on fundamen- 
tal ecological linkages that contribute to 
both social and environmental sustainabil- 
ity. For example, maintaining river-flood- 
plain connectivity for access to drinking 
water, food, and socio-cultural practices 
not only ensures that communities have 
clean water, it also protects the rivers’ self- 
renewing nutrient and sediment cycles, 
which support food webs and riparian 
plants that minimize harmful floods. 

Cooperation and coordination among peo- 
ple and institutions (formal and informal) 
at multiple levels is another tenet of adapt- 
able cities, argue the authors, helping com- 
munities avoid problems that can arise when 
sharing common-pool resources. This idea 
harkens back to one of the arguments made 
by Nobel Prize-winning economist Elinor 
Ostrom: that cities can successfully manage 
resources through collective ownership (2). 

The book applies thinking that is most 
akin to the notion of adaptive cycle resil- 
ience advanced by Lance Gunderson and 
Buzz Holling, who also include the idea of 
major reorganization as a necessary compo- 
nent of socio-ecological resilience (3). (The 
authors of this book use the term “durabil- 
ity” rather than resilience, but the senti- 
ment is seemingly the same.) 

Ultimately, I found the book to be a clear, 
informative, and useful resource that I plan 
to use to teach about socio-environmental 
systems. It successfully makes the case that 
historical cities that exhibited durability 
can offer us valuable insights into how to 
build modern cities that can reinvent them- 
selves in the face of change, providing a 
collection of interesting case studies and a 
look at how archaeologists think about hu- 
man-nature interactions. An easy-to-read 
book, it is suitable as inspiration for urban 
design experts and for teachers and schol- 
ars of socio-ecological sustainability. ® 
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The myth of meritocracy in scientific institutions 


Inaccurate ideas about objectivity and merit perpetuate biases and inequality in academia 


By Leanne Son Hing 


ritten by sociologists Mary 

Blair-Loy and Erin Cech, Mis- 

conceiving Merit investigates 

how STEM (science, technol- 

ogy, engineering, and math) 

faculty in the United States un- 
derstand scientific merit and how these 
understandings can lead to subtle biases 
and perpetuations of inequality. The au- 
thors propose that cultural processes and 
beliefs concerning devotion to work and 
the independence of the scientific enter- 
prise shape how STEM faculty view, treat, 
evaluate, and reward their peers. Critically, 
although STEM professionals de- 
fine excellence as groundbreaking 
research conducted by means of 
objective methods, the pair find 
that faculty often fail to consider 
how their professional culture 
can lead to bias against certain 
groups, which can ultimately un- 
dermine knowledge production 
and its application. 

Written for nonexperts, the book 
reports on a comprehensive study of 
STEM faculty at an anonymized US 
university that is large, prestigious, 
and research intensive. Complex 
phenomena are explained in an 
easy-to-grasp fashion, and rigorous 
data are used to provide evidence for 
the authors’ assertions. 

Researchers will likely see their 
own experiences in the book’s rich 
descriptions of the priorities and 
pressures of a competitive academic en- 
vironment. These descriptions, paired with 
ample quotes from interviewees, make for 
compelling reading, painting a portrait of 
STEM faculty who work all the time, are 
highly engaged with their work, struggle to 
balance personal and professional obliga- 
tions, and feel that they are always behind 
and never enough. 

“Most want objective, neutral STEM 
work without reference to the identities of 
individual scientists. But there’s a catch,” 
argue Blair-Loy and Cech. “Historically— 
and today—that unmarked, unobtrusive 
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identity of scientist can be embodied only 
by white, heterosexual men.” The authors 
weave together a compelling case for this 
argument by drawing on findings from mul- 
tiple sources of data, including university 
data (e.g., rank, pay, group memberships, 
research funding) for all 502 STEM faculty 
in their case university and indicators of 
performance (e.g., number of publications, 
citation counts). They match these data to 
survey responses from an impressive 53% 
of potential respondents, allowing them to 
make strong claims about generalizability. 
Drawing on these data, the authors found, 
for instance, that, although STEM faculty be- 
lieved that researchers who are mothers are 


Bias against underrepresented groups, including mothers, hinders innovation. 


not as devoted to their work as researchers 
who are fathers, the mothers they surveyed 
reported working as many hours per week 
as their male counterparts and performed 
at equal or higher levels (e.g., yearly publi- 
cation rates), despite spending many more 
hours a week performing childcare than 
male researchers who were also parents. 
Nonetheless, female researchers who were 
mothers earned less than their peers. 

The veracity of such findings might be 
called into question given the study’s small 
sample size, low reliability of measures, and 
lack of evidence of validation of measures. 
However, any potential weaknesses are more 
than made up for by additional data sources, 
including survey findings from more than 
7000 STEM faculty from universities across 


Misconceiving Merit 
Mary Blair-Loy and 

Erin A. Cech 

University of Chicago Press, 
2022. 232 pp. 


the US and qualitative interviews with 85 
STEM faculty from the authors’ case uni- 
versity, which provide a rich sense of the 
meaning making in which STEM researchers 
engage in their efforts to understand merit. 

One of the most compelling parts of the 
book details how the authors asked inter- 
viewees to reflect on a study published in 
Science, which found evidence of anti-Black 
bias in NIH funding rates, when 
controlling for key factors such as 
the applicants’ previous funding 
and research productivity (1). The 
participants’ responses revealed 
strong inclinations to reject this 
finding and to continue to justify 
the system as meritorious. Many 
called the study methodology into 
question and insisted that grant 
reviewers steadfastly ignore cues 
about applicants’ identities. Such 
assertions stand in stark con- 
trast with interviews conducted 
with researchers from marginal- 
ized groups that appear earlier in 
the book, in which they recount 
working to hide one’s identity 
and feeling excluded at work, and 
with other interviews that appear 
throughout the book, in which 
straight white male researchers 
expressed discomfort with interacting with 
colleagues from marginalized groups. 

I wish the authors had spent more time 
laying out avenues for change, as this book 
will likely be of interest to university ad- 
ministrators, professional associations, and 
STEM thought leaders. However, they do 
provide some clear observations that STEM 
faculty and their institutions must recog- 
nize to move forward, including the fact that 
bias exists in this setting, that collaborative 
work and mentorship of students and junior 
colleagues is essential, and that a healthier 
work-family culture can benefit all. 
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Edited by Jennifer Sills 


Editorial Retraction 


On 22 August 2014, Science published the 
Research Article “Chemically mediated 
behavior of recruiting corals and fishes: A 
tipping point that may limit reef recovery” 
by D. L. Dixson et al. (/). On 17 February, 

we published an Editorial Expression of 
Concern (2) after being made aware of data 
manipulation accusations pertaining to this 
paper. In August, the University of Delaware 
informed us that the data in Figures 1A, 2, 
3, and 4 were questioned and that they no 
longer have confidence in the validity of the 
data. In agreement with the reeommenda- 
tion of the University of Delaware, Science is 
retracting the paper. 

H. Holden Thorp 

Editor-in-Chief 
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The Amur tiger needs 
transborder protection 


Of the nine subspecies of tiger, three—the 
Bali, Javan, and Caspian tigers—became 
extinct in the 20th century. The South China 
tiger, unique to China, became extinct in 
the wild in the 1990s (7). Human activities, 
including migration, war, wildlife trade 
that includes illegal sales, and deforesta- 
tion, caused their extinction (2-4). The 
remaining five tiger subspecies, which are 
all Endangered or Critically Endangered (5), 
continue to face these threats. Thanks to 
effective conservation policies, the popula- 
tion of one subspecies, China’s Amur tiger 
(Panthera tigris altaica), has grown sub- 
stantially in the past few years. However, 
the Amur tigers often cross China’s border 
into Russia, complicating protection efforts. 
China and Russia can better protect the 
Amur tiger by working together. 

The Amur tiger was listed in 2011 as 
Endangered by the International Union 
for Conservation of Nature (IUCN) (6). 
It is also a Class I key protected cat spe- 
cies in China due to its population decline 
(7). China has taken important steps to 
recover the population, such as bans on 
culling and deforestation, captive breeding, 
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and the establishment of tiger reserves in 
Heilongjiang and Jilin provinces (8, 9). As a 
result of these policies, the Amur tiger popu- 
lation in the wild, including in reserves, has 
increased from 27 in 2017 to more than 60 
in 2022 (10). However, when the tigers cross 
from China into Russia, chances of human- 
wildlife conflicts increase. 

To ensure consistent protection, Russia 
should implement the strategies that have 
worked in China. Both countries should also 
put in place penalties and strict enforce- 
ment to discourage the capture and hunt- 
ing of tigers for commercial gain (77). In 
addition, China and Russia each need a 
well-publicized wildlife compensation policy 
to provide reimbursement for property 
losses caused by tigers, mitigating the risk 
of ranchers killing tigers who attack or 
threaten livestock. News media and other 
educational outlets in both China and Russia 
should emphasize the importance of pro- 
tecting endangered species and preventing 
human-tiger conflicts. 

Given the threats to the remaining tiger 
populations, it is crucial to build on exam- 
ples of conservation success. The Amur tiger 
population has shown signs of recovery; 
transboundary cooperation could help save 
this vulnerable subspecies. 
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Genomic research data 
and the justice system 


In their Policy Forum “Get law enforce- 
ment out of biospecimen authentication” 
(17 June, p. 1274), D. J. H. Mathews and N. 
Ram propose an identification system for 
cells and tissues used in biomedical research 
that would be separate and distinct from the 
Combined DNA Index System (CODIS) used 
by the US criminal justice system. Their pro- 
posal could potentially create artificial silos 
between genomic data in the justice system 
and in biomedical research, making it inef- 
ficient and ultimately counterproductive. 
CODIS was purposefully designed to guar- 
antee that people could be identified while 
their personal data, such as health informa- 
tion, remained private (7). In this system, 
short tandem repeats (STRs) create a unique 
DNA fingerprint as an identifier for each 
person. These STRs were initially thought to 


Efforts by China and Russia could save the Amur tiger from extinction. 
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be unrelated to any phenotypic information, 
although evidence may now suggest other- 
wise (2). However, preserving anonymity 
remains a fundamental intent of the system. 
The design of CODIS reflects the fact that 
suspected criminals (3) and even prison 
inmates (4) retain some privacy rights under 
the US Constitution. 

Mathews and Ram acknowledge the 
economies of scale and the sound practical 
reasons for using the same genetic loci in 
research settings and in CODIS but argue 
that using alternative loci for tissue and 
cell-line authentication would be preferable. 
We disagree. The use of CODIS loci within a 
research-based authentication system—with 
the necessary safeguards in place, such as 
the penalties legislated under the Justice 
for All Act of 2004 (5)—would be unlikely 
to exacerbate problems of bias. This is espe- 
cially true given that DNA fingerprints are 
otherwise easily attainable: Currently, a del- 
uge of private genomic data has been made 
public through various direct-to-consumer 
platforms, and it is increasingly trivial to 
sequence and deanonymize DNA from tis- 
sues and cells. 

Mathews and Ram note that the criminal 
justice system is unwilling to share any DNA 
fingerprints for research purposes, which 
makes sense given the strict criminal penal- 
ties for the misuse of CODIS data (6). Thus, 
under Mathews and Ram’s proposed system, 
research into determining problematic cor- 
relations between phenotypes and CODIS 
STRs will ultimately be limited, as these data 
will be collected primarily by the justice sys- 
tem, not by bioscientists. 

We believe that using an alternative STR 
system would be a lost opportunity. With 
research-based authentication using the 
same loci as CODIS, the amount of useful 
data that could be collected and correlated 
with the CODIS STRs will grow as we 
expand research into these STRs. Collecting 
such data could provide an invaluable 
service in either supporting or refuting 
concerns that CODIS may not be as privacy 
friendly as first thought. If the resulting 
research were to show strong phenotypic 
correlations with CODIS loci, we would then 
demand new loci not only for researchers, 
as Mathews and Ram suggest, but for the 
criminal justice system as well. 
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Response 


Greenbaum and Gerstein argue that we 
should embrace the dissolving of silos 
between contexts and data sources by using 
Combined DNA Index System (CODIS) 
markers for research authentication pur- 
poses. However, we believe the silos that 
isolate research data from law enforcement 
data and activities are both justified and 
critical. The research community must take 
steps to reduce the risk that these silos fall. 

Greenbaum and Gerstein assert that 
we should instead maximize the amount 
of research data that can be collected and 
compared with the CODIS short tandem 
repeats (STRs). As they rightly note, the 
assumption that CODIS STRs and phe- 
notypic information are not related is 
increasingly untenable (7). However, this 
observation does not necessitate that 
researchers use CODIS STRs specifically 
for authentication. We support Greenbaum 
and Gerstein’s call for further research 
on CODIS STR data, and we trust that 
researchers interested in such work do not 
intend to “misuse” the data. But CODIS- 
focused research should be undertaken 
directly and described to research partici- 
pants as such, rather than occurring as a 
by-product of authentication standards. 
Whereas the Justice for All Act might 
apply to certain researchers funded by 
the Department of Justice, its protec- 
tions do not appear to cover any genetic 
data outside the criminal legal system. 
Furthermore, law enforcement’s refusal to 
permit access to CODIS itself should not 
be used to argue for increasing risks to all 
research participants. 

Greenbaum and Gerstein also argue that 
avoiding CODIS markers for authentication 
is unnecessary because, effectively, the cat 
is out of the bag. Given the preponderance 
of direct-to-consumer platforms and the 
ease with which data from tissues can be 
deanonymized, Greenbaum and Gerstein 
suggest that CODIS marker-based authen- 
tication standards do little harm. 

We disagree. As we explain in our Policy 
Forum, the risk of attracting law enforce- 
ment interest to research data increases 
when the data are available in a recogniz- 
able way, which is precisely what CODIS 


marker-based authentication standards do. 
Data produced pursuant to these standards 
may be, and indeed likely are, even easier 
for law enforcement to exploit than the 
sprawling family trees that investigative 
genetic genealogy with consumer genetic 
data requires. 

Finally, Greenbaum and Gerstein do 
not respond to the other serious harms 
our Policy Forum identified. Using CODIS 
markers could undermine the trust and 
subsequently recruitment of research par- 
ticipants, including those from historically 
marginalized and excluded groups (whose 
trust may be particularly undermined when 
research uses the language and tools of law 
enforcement), and impose additional poten- 
tial harms on already vulnerable popula- 
tions, including children (whose inclusion in 
genetic research is predicated on a “minimal 
risk” analysis that may no longer be tenable). 
The risks are substantial, and the benefits of 
both biospecimen authentication and scien- 
tific research can be secured in ways that are 
less likely to erode trust in science. 
Debra J. H. Mathews?2* and Natalie Ram? 
1Berman Institute of Bioethics, Johns Hopkins 
University, Baltimore, MD 21205, USA. 
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Comment on “The missing ocean plastic sink: 
Gone with the rivers” 

Lei Mai, Xiang-Fei Sun, Eddy Y. Zeng 

Weiss et al. (Reports, 2 July 2021, p. 107) 
incorrectly conclude that the residence time 
for floating microplastic stock at the ocean 
surface is ~2.4 years. We contend that this 
conclusion is fundamentally flawed, because 
the residence time is obtained through 
dividing a published oceanic stock of plastic 
debris by a doubtable river microplastic flux 
estimated by the authors. 

Full text: dx.doi.org/10.1126/science.abl6433 


Response to Comment on “The missing ocean 
plastic sink: Gone with the rivers” 


Lisa Weiss and Wolfgang Ludwig 

Mai et al. are mistaken in their assertions 
that we incorrectly calculated the residence 
time for floating microplastic stock at the 
ocean surface and that most of our results 
are not novel. Their claim that our field-mea- 
sured data and methods were not rigorous is 
wrong, as shown by a more careful consider- 
ation of what was done. 

Full text: dx.doi.org/10.1126/science.abm7305 
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EVOLUTIONARY ECOLOGY 
Optimized strategies for life 


etabolic theory posits that physical con- 

straints on energy uptake and allocation 

drive biological processes. This theory 

predicts broad ecological patterns such as 

the observed allometric scaling relation- 
ship between animals’ metabolic rate and body 
size. White et al. developed a new theory showing 
that such patterns can also be explained by evolu- 
tion acting concurrently on metabolism, growth, 
and reproduction. Using data from over 10,000 
species from 12 animal phyla, the authors found 
support for the prediction that species’ lifetime 
reproduction is optimized, with growth and repro- 
duction both increasing with metabolic level. This 
work proposes a bridge between metabolic and 
life history theory, two basic ways of understand- 
ing ecological patterns. —BEL 


Science, abm7649, this issue p. 834 
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Theory helps to explain observed ecological and metabolic 
patterns such as the many sizes and life histories of fish. 


SYNTHETIC BIOLOGY 
Decoding a multisite 


mechanism 


A multisite phosphorylation 
mechanism defines a complex 
signaling system in which the 
countervailing influences of the 
phosphorylation of more than 
one site create a highly nuanced 
cellular response, in contrast 

to the simple on-off switch of 
single-site phosphorylation. 
Faustova et al. have unraveled 
the “code” for one such system, 
the signals from cyclin-depen- 
dent kinases controlling nuclear 
localization and nuclear export. 
Controlling the phosphorylation 
sites, cyclin-specific short linear 
motifs, phosphopriming, and 
other manipulations enabled 
synthetic alteration of the 
observed signaling with control 
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of the delay in the responses, 
the magnitude of responses, 
and the localization dynamics. 
—PLY 
Sci. Adv. 10.1126/ 
sciadv.abp8992 (2022). 


NEURODEVELOPMENT 
Allocating sensory 
responsibilities 

During late embryogenesis, the 
developing mouse brain has 
inputs from both visual and 
sensory systems that activate 
overlapping regions of the 
cortex. Guillam6dn-Vivancos et 
al. now show how spontane- 
ous activity in the perinatal 
retina tunes the visual cortex to 
specialize in interpreting retinal 
activity as vision. Without such 
spontaneous retinal activity, 


this part of the brain will take 
on somatosensory responsi- 
bilities instead. This handshake 
between the sensory system 
and the cortical recipient is 
established during a limited 
time in the perinatal phase of 
development. —PJH 

Science, abq2960, this issue p. 845 


QUANTUM PHYSICS 
Combining lattices 
and tweezers 


Optical lattices have been used 
as a platform for quantum simu- 
lation for the past two decades. 
More recently, arrays of optical 
tweezers, which have the advan- 
tage of rapid reconfigurability, 
have risen to prominence. Young 
et al. combined these tools to 
perform large-scale quantum 
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walks of strontium-88 atoms 
prepared in optical tweezers, 
and then implanted into the 
sites of an optical lattice. The 
combined platform holds prom- 
ise for applications in quantum 
science. —JS 

Science, abo0608, this issue, p.885 


METALLOENZYMES 


Caught N the act 


Although the nitrogen-fixing 
enzyme nitrogenase has been 
studied extensively by x-ray 
crystallography, enzymes 

in crystals cannot move or 
form transient complexes 

as they would in solution. 
This is a particular problem 
for nitrogenase because the 
protein that harbors the site 
of nitrogen reduction requires 
a dynamic interaction with 
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an ATP-hydrolyzing sub- 
unit. Rutledge et al. used 
cryo-electron microscopy to 
examine nitrogenase samples 
prepared under turnover 
conditions with abundant ATP 
and the chemical reductant 
dithionite. They observed an 
asymmetric complex of the 
two proteins and, correspond- 
ingly, differences between 
subunits at the interface and 
near the key nitrogenase 
cofactor, FeMoco. These 
structures generate many 
new questions about electron 
delivery and coupling between 
the proteins, which are key to 
understanding this essential 
reaction. —MAF 

Science, abq7641, this issue p. 865. 


Chip-less electronic skin 
Flexible electronic materials, 
or e-skins, can be limited 

by the need to include rigid 
components. A range of 
techniques have emerged to 
bypass this problem, includ- 
ing approaches for wireless 
communication and charg- 
ing based on silicon, carbon 
nanotubes, or conducting 
polymers. Kim et a/. show that 
epitaxially grown, single-crys- 
talline gallium nitride films 

on flexible substrates can be 
used for chip-less, flexible 
e-skins. The main advantage 
is that the material is flexible 
and breathable, thus providing 
better comfort. The devices 
convert electrical energy 

into surface acoustic waves 
using a piezoelectric resona- 
tor. The resonator is sensitive 
to changes in strain, mass 
changes due to the absorption 
or loss of ions, and ultraviolet 


om: 


Anew flexible film may be of use in 
wearable electronics. 


830 


light, all of which can be used 
for different sensing measure- 
ments. —MSL 

Science, abn7325, this issue p.859 


Contextually 
complicated CAFs 
Cancer-associated fibroblasts 
(CAFs) are often implicated 
in drug resistance and tumor 
progression, but they can 
also be associated with tumor 
suppression. Remsing Rix et al. 
discovered another twist in the 
role of CAFs in drug efficacy: 
The same CAFs that induced 
drug resistance in some lung 
cancer cell lines promoted 
drug sensitivity in others. In 
certain tumors, drug-induced 
compensatory signaling was 
attenuated by certain proteins 
secreted by CAFs, thus enhanc- 
ing or restoring drug efficacy in 
treatment-naive and drug- 
resistant cells. —LKF 

Sci. Signal. 15, eabj5879 (2022). 


Boosting a weakened 
response 


The Omicron strain of severe 
acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2) 
was first detected in 2021 and 
is now dominant across the 
globe. Bowen et al. show that 
the Omicron sublineages BA.1, 
BA.2, BA.2.12, and BA.4/5 have 
increased affinity for the ACE2 
receptor on human cells, but 
show decreased cell-cell fusion. 
The authors used pseudovirus 
to measure neutralizing activity 
against Omicron sublineages 
in plasma from individuals who 
were previously infected or 
vaccinated with one of seven 
vaccines, all based on the 
Wuhan-Hu-l variant. For both 
vaccinees and convalescents, 
there was severe dampening of 
neutralizing activity. Boosting 
with either the same vaccine 
used initially or one of the other 
Wuhan-Hu-1-based vaccines 
increased neutralization across 
all vaccines evaluated. —VV 
Science, abq0203, this issue p.890 
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How to nudge in 

the right direction 

Public health communications 
during the COVID-19 pandemic 
emphasized guidance from 
medical authorities to practice 
social distancing, masking, and 
handwashing. However, these 
authoritative messages (i.e., 
“don't break official COVID-19 
guidelines”) may not be the 
most effective way to moti- 
vate compliance because of 
psychological reactance. In 

a field study during January 
2021, Banker et al. analyzed 
data from over 57,000 travel- 
ers ina major US airport that 
presented color-coded visual 
indicators about crowdedness 
levels. The visual indicators 
were accompanied by short 
messages called nudges (e.g., 
“Protect yourself. Go toward 
green for less crowded areas”). 
Compared with authoritative 


nudges from the Centers for 
Disease Control and Prevention 
(CDC), the visual indicators that 
accompanied informal nudges 
motivated more social distanc- 
ing in the airport and may be 
more effective at preventing 
transmission of pathogens than 
more officious messages. —EEU 
Proc. Natl. Acad. Sci. U.S.A. 
119, e2116156119 (2022). 


-IEN 
Coronavirus in cortical 
astrocytes 


Many patients suffering with or 
having recovered from COVID- 
19 present with neurological 
symptoms. Are these symp- 
toms a result of direct infection 
of the nervous system or are 
they a secondary consequence 
of widespread inflammation 

in other tissues? Andrews et 

al. used primary cortical tis- 
sue from both the developing 
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brain and the adult brain, 
paired with stem cell-derived 
cortical organoids, to evalu- 
ate which human neural cell 
types can be directly infected 
by SARS-CoV-2. In primary 
cortical tissue cultures and 
cortical organoids exposed 
to SARS-CoV-2, the authors 
observed significant infection 
and viral replication specifically 
in immature and mature astro- 
cytes, which has downstream 
consequences for neurological 
function. —PRS 

Proc. Natl. Acad. Sci. U.S.A. 


119, 2122236119 (2022). 


CARBON CAPTURE 
Protecting quinones 
with ethanol 


Developing robust technol- 
ogy to capture carbon dioxide 
emissions from combustion 
streams is a key factor in 
mitigating climate change. 


SCIENCE science.org 


Most current approaches focus 
on amines that bind chemically 
to carbon dioxide and release it 
upon heating. An electrochemi- 
cal approach to toggling the 
catch-and-release modes could 
be more efficient. However, in 
such cases, capture agents such 
as reduced quinones may react 
with oxygen instead of carbon 
dioxide. Barlow and Yang report 
that added ethanol can head off 
this undesired pathway through 
hydrogen bonding, raising pros- 
pects for further investigation of 
this method. —JSY 

J.Am. Chem. Soc. 10.1021/ 

jacs.2c04044 (2022). 


PHOTOSYNTHESIS 


Building chlorophyll b 
Photosynthetic organisms 

use a wide range of pigments 
to absorb light energy and 
convert it into chemical energy. 
Chlorophyll is a familiar green 
pigment, but there are in fact 


STELLAR ASTROPHYSICS 
Zooming into the 
most massive star 


he most massive individual 

star known is R136al1 in the 

Tarantula Nebula, but its exact 

mass has been the subject of 

debate. R136al is located ina 
dense stellar cluster, so atmospheric 
distortion causes blending with other 
cluster members, contaminating 
measurements. Kalari et al. performed 
ground-based speckle imaging of 
R136al, achieving spatial resolution 
of 30 to 40 milliarcseconds, which is 
better than previous observations, 
including those taken from space. This 
resolves R136al from two contaminat- 
ing stars. Combining their photometric 
measurements with stellar models, 
the authors determined that R136al 
formed with 200 + 30 solar masses, 
which is lower than previous estimates 
but still the most massive individual 
star known. —KTS Astrophys. J. 
[Preprint] arXiv:2207.13078 (2022). 


The Tarantula Nebula, photographed here 
by NASA's Spitzer Space Telescope, 
hosts the most massive individual star known. 


several kinds of chlorophy!! 
with differences in peripheral 
functional groups that tune the 
absorption spectrum and redox 
potential. Liu et al. character- 
ized an oxidase involved in the 
synthesis of chlorophyll b, an 
essential, blue-shifted pigment. 
In addition to uncovering the 
details of the mechanism and 
specificity that may be useful in 
engineering efforts, the authors 
show that this enzyme can 
accept a substrate analog, lead- 
ing to a non-native chlorophyll 
derivative. -MAF 
ACS Cent. Sci. 10.1021/ 
acscentsci.2c00058 (2022). 


SIGNAL TRANSDUCTION 
Positioning signaling 
adaptors 


Phosphorylation of tyrosine 
residues in proteins is a com- 
mon mechanism for cellular 
signal transduction. Basant and 


Way show that positioning of 
phosphotryosine residues can 
have particular importance in 
signal output. Phophotyrosine 
residues are modular, in that 
they recruit adaptor proteins 
that bind phosphotyrosine- 
containing motifs. The authors 
focused on two phosphotyro- 
sine residues in the protein A36 
from Vaccinia virus that allows 
the virus to interact with a host 
cell's cytoskeleton to enable its 
transport and release. Results 
indicate that the positioning of 
the two sites is optimized for 
efficient signaling output. Such 
findings may be valuable for 
synthetic biology applications, 
in which such modules could 
be used to design signaling 
outcomes. —LBR 

eLife 11, e74655 (2022). 


MALARIA 
Plastic parasites elude 
control 


Malaria parasites, Plasmodium 
spp., are genetically plastic 
and resistant to control efforts 
despite—or possibly even 
because of—their complex life 
cycles. Public health interven- 
tions could therefore have 
effects beyond the induction 
of drug resistance. Early et al. 
applied a modeling approach, 
supported by genomics, 
to Plasmodium falciparum 
obtained in French Guiana to 
determine how control efforts 
might influence the parasite’s 
life cycle. Only sexual forms 
of P falciparum can infect 
mosquitoes and be transmitted 
to new hosts, but only asexual 
forms can replicate and amplify 
in humans. As the number of 
human cases declined, the 
authors found that the most 
strongly selected genes were 
enriched for the transcription 
factors required for sexual 
development. Higher rates of 
sexual conversion that lead to 
greater transmission potential 
from humans to mosquitoes, 
which could undermine the 
impact of control measures. 
—CA 

Proc. Natl. Acad. Sci. U.S.A. 

119, e2122165119 (2022). 


19 AUGUST 2022 « VOL 377 ISSUE 6608 831 


RESEARCH 


ALSO IN SCIENCE JOURNALS é fesoainiciadleick 


CORONAVIRUS 
T cells in vaccine 
immunity 
Whether a person has protective 
immunity derived from a vaccine 
is usually measured in terms of 
neutralizing antibody responses. 
Severe acute respiratory 
syndrome coronavirus 2 (SARS- 
CoV-2) variants have mutations 
that reduce neutralization by 
antibodies, which has increased 
breakthrough infections in vac- 
cinated people. However, these 
infections have not proportion- 
ally increased hospitalization 
and deaths, indicating that anti- 
bodies are not the whole story. 
Ina Perspective, Wherry and 
Barouch discuss the emerging 
role of T cell immunity in pre- 
venting severe COVID-19, even 
when it is caused by antibody- 
escape variants. Measuring T 
cell responses to vaccines will 
improve our understanding of 
their role in immune protection, 
especially in the long term after 
antibody waning. This work 
should also lead to optimized 
immune protection strategies, 
including the development of 
updated COVID-19 or pan-beta- 
coronavirus vaccines. —GKA 
Science, add2897, this issue p.821 


NEUROGENETICS 
Zooming in on 
dementia genes 


Neurodegenerative disorders 
are a major problem worldwide, 
and their incidence is continuing 
to increase. These are usually 
complex polygenic traits, and 
although many gene loci have 
been identified in associa- 

tion with these disorders, the 
functional roles of these loci 

are unclear and most of them 
fall within noncoding regions 

of the genome. Cooper et al. 
used massively parallel reporter 
assays to screen noncoding 
variants reported in genome- 
wide association studies of two 
neurodegenerative disorders, 
followed by functional validation 


829-B 


in neurons and microglia. With 
this combination of methods, 
the authors identified regula- 
tory variants in several different 
genes that play functional roles 
in disease pathogenesis, as well 
as interactions between these 
genes. —YN 

Science, abi8654, this issue p. 832 


PARKINSON’S DISEASE 
Uncovering a contributor 


Genome-wide association stud- 
ies (GWAS) have uncovered 
nearly 100 loci that contribute 
to risk for Parkinson's disease 
(PD), which affects an estimated 
6 million people worldwide. 
However, target genes and 
biological mechanisms associ- 
ated with these loci remain 
largely unexplored. Diaz-Ortiz 
et al. examined a PD GWAS risk 
locus on chromosome 7 linking 
it to the transmembrane protein 
Glycoprotein Nonmetastatic 
Melanoma Protein B (GPNMB). 
GPNMB was found to interact 
with alpha-synuclein (aSyn), 
the key protein that forms 
the pathological inclusions 
that characterize PD (see the 
Perspective by Mollenhauer and 
von Arnim). In cells, GPNMB was 
both necessary and sufficient 
for the uptake of fibrillar forms 
of aSyn and the subsequent 
development of aSyn pathology. 
—SMH 

Science, abk0637, this issue p. 833; 

see also add7162, p.818 


DEFLUORINATION 
Forever chemicals’ 
Achilles’ heel 


Per- and polyfluoroalky! 
substances (PFAS) have been 
referred to as “forever chemi- 
cals” because of their resistance 
to most biological and chemical 
degradation mechanisms. Most 
current methods use very harsh 
conditions to decompose these 
compounds. Trang et al. found 
that there is a potential weak 
spot in carboxylic acid—contain- 
ing PFAS: Decarboxylation in 
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polar, non-protic solvents yields 
a carbanion that rapidly decom- 
poses (see the Perspective by 
Joudan and Lundgren). The 
authors used computational 
work and experiments to show 
that this process involves 
fluoride elimination, hydroxide 
addition, and carbon-carbon 
bond scission. The initial decar- 
boxylation step is rate limiting, 
and subsequent defluorination 
and chain-shortening steps 
occur through a series of low 
barrier steps. The procedure 
can accommodate perfluoro- 
ether carboxylic acids, although 
sulfonic acids are not currently 
compatible. —MAF 

Science, abm8868, this issue p. 839; 

see also add1813, p.816_ 


PLANT SCIENCE 
More soybeans by 
light management 


Plants protect themselves 
from too much sun by dis- 
sipating excess light energy. 
Unfortunately, the switch from 
issipating light energy to using 
light energy for photosynthesis 
is not as nimble as the clouds 
moving across the sky. De Souza 
et al. applied a bioengineered 
solution that speeds up accom- 
modation by nonphotochemical 
quenching in soybeans, a widely 
cultivated and essential crop. In 
ield trials, seed yield increased 
nsome cases up to 33%. —PJH 
Science, adc9831, this issue p.851 


Q 
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NANOMATERIALS 
Ligand-driven 
metastability 


A high-pressure phase of a solid 
can persist at ambient pressure 
if there are kinetic barriers to its 
relaxation. Xiao et al. performed 
detailed mechanistic studies 

on the reversibility of four- to 
six-coordinate pressure-driven 
solid-phase transitions in well- 
controlled model systems of 
nanospheres or nanorods of 
cadmium selenide, cadmium 


sulfide, or both (see the 
Perspective by Mao and Lin). The 
choice of surface ligands could 
control the reversibility of the 
transformations. Interparticle 
sintering helped to eliminate 
crystal defects and relaxed 
lattice distortions from the high- 
pressure rock-salt structures to 
maintain their ambient-pressure 
metastability. —PDS 

Science, abq7684, this issue p. 870; 

see also add5433, p. 814 


THERMOELECTRICS 
A flexible power source 


Thermoelectric materials can 
harvest heat and turn it into 
power. Heat sources potentially 
include the heat generated 
by humans through wearable 
devices and might enable self- 
powering systems, but the lack 
of ductility for most thermoelec- 
trics poses a major problem. 
Yang et al. found a thermo- 
electric silver/copper—based 
semiconductor that also is duc- 
tile (see the Perspective by Hou 
and Zhu). This material allows 
for a thin, flexible device capable 
of producing power, even when 
adhered to a wrist. -BG 

Science, abq0682, this issue p. 854; 

see also add7029, p. 815 


IMMUNOLOGY 
The complete BCR comes 
into focus 


B cell activation is initiated by 
antigen binding to the B cell 
receptor (BCR), resulting in pro- 
liferation and differentiation into 
either plasma cells or memory 
cells. A better understanding 

of B cell signaling depends on 
amore complete picture of the 
BCR's structure. Two groups 
resolved cryo-electron micros- 
copy (cryo-EM) structures of 
the intact BCR complex, which 
comprises a membrane-bound 
immunoglobin (lg) linked to 

a transmembrane lg-a/lg-B 
(CD79) signaling complex (see 
the Perspective by Tolar and 
Pierce). Su et al. determined the 
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cryo-EM structure of the IgM 
BCR, and Maet al. describe cryo- 
EM structures of both IgG and 
IgM BCRs. Both studies reveal 
important features of the BCR's 
structure in its native state 
before engagement by antigen, 
and this work opens up the field 
to future studies of B cell activa- 
tion and function. —STS 
Science, abo3923, abo3828, 
this issue p..875, p. 880; 
see also add8065, p. 819 


AUTOIMMUNITY 
Turning down TRAF3 


revs up B cells 


TRAF3 is a cytoplasmic adap- 
tor protein involved in multiple 
intracellular signal transduction 
pathways. Rae et al. identified 
nine individuals with a previously 
undescribed monogenic immune 
dysregulatory syndrome caused 
by aloss-of-function mutation 
in one of their two TRAF3 alleles. 
Patients with this haploinsuf- 
ficiency syndrome exhibited 
B cell hyperactivity leading to 
hypergammaglobulinemia and 
autoimmunity, but they also dis- 
played increased susceptibility 
to recurrent bacterial infec- 
tions. Common human genetic 
variants of TRAF3 associated 
with lower gene expression were 
found to carry an increased 
risk of autoimmune disease 
and some B cell malignancies. 
These findings point to func- 
tional impairment of TRAF3 as 
a shared mechanism leading to 
B cell dysfunction, hyperactiv- 
ity, and disease in the presence 
of either rare loss-of-function 
germline mutations or much 
more common genetic variants. 
—IRW 

Sci. !mmunol. 7, eabn3800 (2022). 


AUTISM SPECTRUM 
Exploiting face 
processing 

The heterogeneity observed in 
patients with autism spectrum 
disorder (ASD) highlights the 
need for better patient strati- 
fication methods. Mason et al. 


evaluated the use of the speed 
of early-stage face processing 


SCIENCE science.org 


(N170 latency) for patient 
stratification and prognosis 
in subjects with ASD and in 
age-matched healthy individu- 
als. N170 latency was slower in 
individuals with ASD and was 
correlated with response to 
faces measured with functional 
magnetic resonance imaging 
and with polygenic risk score. 
Among subjects with ASD, the 
N170 values stratified patients 
according to socialization 
prognosis and improved power 
in a simulated clinical trial. The 
results suggest that including 
N170 evaluation in patient strati- 
fication might help in the design 
and development of patient-spe- 
cific therapies for ASD. -MM 
Sci. Transl. Med. 14, eabf8987 (2022). 


CELL BIOLOGY 
Phosphorylated XPD 
regulates mitosis 


Xeroderma pigmentosum 
group D protein (XPD) is a DNA 
helicase involved in DNA repair 
and transcription. XPD muta- 
tions cause a variety of human 
recessive disorders, including 
neurological and skin abnormali- 
ties, elevated cancer rates, and 
intellectual and growth deficits. 
Compe et al. show that XPD 
phosphorylated by the CDK1 
mitotic kinase interacts with 
Eg5, a motor protein required to 
establish a mitotic spindle. Cells 
from XP-D patients with muta- 
tions that block the XPD-Eg5 
interaction display defective 
mitosis and chromosome seg- 
regation. The authors conclude 
that, in addition to changes in 
DNA repair and transcription, 
mitotic defects contribute to 
XP-D patient phenotypes. —DLD 
Sci. Adv. 10.1126/ 
sciadv.abp9457 (2022). 
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NEUROGENETICS 


Functional regulatory variants implicate distinct 
transcriptional networks in dementia 


Yonatan A. Cooper, Noam Teyssier, Nina M. Drager, Qiuyu Guo, Jessica E. Davis, Sydney M. Sattler, 
Zhongan Yang, Abdulsamie Patel, Sarah Wu, Sriram Kosuri, Giovanni Coppola, 


Martin Kampmann, Daniel H. Geschwind* 


INTRODUCTION: The widespread adoption of 
genome-wide association studies (GWASs) 
has revolutionized the detection of genetic 
loci associated with complex traits. How- 
ever, identification of the causal variants and 
mechanisms underlying genotype-phenotype 
associations poses an enormous challenge 
because most common susceptibility loci 
reside in noncoding genomic regions and are 
composed of many correlated polymorphisms 
owing to linkage disequilibrium (LD). Mas- 
sively parallel reporter assays (MPRAs) permit 
the high-throughput functional character- 
ization of noncoding genetic variation, yet 
they have not been systematically applied to 
neurodegenerative disease. In this study, we 
used MPRA coupled with CRISPR-based vali- 


Regulatory variant identification 
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dation to identify likely causal genetic variants 
underlying two neurodegenerative conditions 
that are neuropathologically linked by intra- 
cellular tau protein aggregation—Alzheimer’s 
disease (AD) and progressive supranuclear 
palsy (PSP). 


RATIONALE: Neurodegenerative dementias such 
as AD and PSP are a major cause of morbidity 
and mortality worldwide. There are no disease- 
modifying therapeutics for either disorder, 
motivating major efforts to characterize dis- 
ease pathogenesis. Identifying causal genetic 
factors and downstream associated risk genes 
are fundamental initial steps in developing a 
mechanistic understanding that would enable 
therapeutic development. 


Biological insight 


Systematic characterization of disease loci 
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Disease-specific regulatory networks 
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Massively parallel functional genomics identifies genes and mechanisms in dementia. We used MPRA 
to screen thousands of common genetic variants associated with either AD or PSP to identify 300 likely 
functional regulatory variants. When coupled with CRISPR editing, this approach permitted the dissection 
of multiple risk loci and identification of several risk genes. The ability to view the aggregate activity of 
genetic risk shows convergence on disease- and cell type-specific transcriptional networks. TF, transcription 


factor; PPI, protein-protein interaction. 
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RESULTS: We tested the transcriptional reg- 
ulatory activity of 5706 noncoding single- 
nucleotide variants, representing 25 genome-wide 
significant loci associated with AD and nine loci 
associated with PSP, by MPRA, using the 
neuroepithelial-like human embryonic kid- 
ney 293T (HEK293T) cell line. We identified 
320 different functional regulatory variants 
(frVars) that affect gene expression within 
27 of these loci. AD frVars were enriched with- 
in microglial enhancers, whereas PSP frVars 
were enriched within neuronal enhancers 
and, to a lesser extent, oligodendrocytes, 
consistent with differential cellular impact 
in each disease. 

The majority of frVars (94%) overlapped 
two or more known functional annotations in 
human brain tissue or blood, nearly two-thirds 
of which were predicted to disrupt transcrip- 
tion factor binding, indicating their potential 
relevance in human disease. Forty-two high- 
confidence regulatory variants distributed 
across 15 AD loci and three PSP loci were 
selected for validation, using either CRISPR 
droplet sequencing (CROP-seq) or direct 
CRISPR excision in induced pluripotent stem 
cell-derived neurons, microglia, and astrocytes, 
enabling validation of 19 functional variants, 
implicating 20 risk genes across 11 loci. 

Our data provide evidence implicating C4A, 
PVRL2, and APOC] as risk genes in AD and 
PLEKHM1 and KANSL1 as risk genes in PSP, 
as well as additional validation for more than 
a dozen other genes. MPRA-defined functional 
variants preferentially disrupt predicted tran- 
scription factor binding sites (TFBSs) that 
converge on enhancers with differential cell 
type-specific activity in PSP and AD, implicat- 
ing a neuronal SP/-driven regulatory network 
in PSP pathogenesis. 


CONCLUSION: We provide systematic charac- 
terization of common variants underlying dis- 
ease risk for two distinct neurodegenerative 
disorders, AD and PSP. Our work illus- 
trates the utility of integrating multiplexed 
reporter and CRISPR assays to efficiently 
characterize noncoding disease-associated 
variation, thereby permitting the identifi- 
cation of risk genes, even in complex loci 
with high LD. These analyses support a 
mechanism underlying noncoding genetic 
risk, whereby common genetic variants drive 
disease risk in aggregate through polygenic 
cell type-specific regulatory effects on spe- 
cific gene networks. 


The list of author affiliations is available in the full article online. 
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NEUROGENETICS 


Functional regulatory variants implicate distinct 
transcriptional networks in dementia 


Yonatan A. Cooper”, Noam Teyssier*, Nina M. Drager’, Qiuyu Guo*, Jessica E. Davis®, 
Sydney M. Sattler*, Zhongan Yang°, Abdulsamie Patel®, Sarah Wu®, Sriram Kosuri°, 
Giovanni Coppola*®, Martin Kampmann *’, Daniel H. Geschwind?®910* 


Predicting the function of noncoding variation is a major challenge in modern genetics. In this study, 
we used massively parallel reporter assays to screen 5706 variants identified from genome-wide 
association studies for both Alzheimer’s disease (AD) and progressive supranuclear palsy (PSP), 
identifying 320 functional regulatory variants (frVars) across 27 loci, including the complex 

17q21.31 region. We identified and validated multiple risk loci using CRISPR interference or excision, 
including complement 4 (C4A) and APOC1 in AD and PLEKHM1 and KANSL1 in PSP. Functional variants 
disrupt transcription factor binding sites converging on enhancers with cell type-specific activity in 
PSP and AD, implicating a neuronal SP1-driven regulatory network in PSP pathogenesis. These analyses 
suggest that noncoding genetic risk is driven by common genetic variants through their aggregate 


activity on specific transcriptional programs. 


eurodegenerative disorders such as Alz- 

heimer’s disease (AD) and progressive 

supranuclear palsy (PSP) are a major 

cause of morbidity and mortality world- 

wide (/). Given the paucity of disease- 
modifying therapeutics, there is a considerable 
need for investigation of causal disease mech- 
anisms. Sporadic AD and PSP, both known as 
tauopathies because of the pathological depo- 
sition of tau in the brains of affected indi- 
viduals (2), are complex polygenic traits with 
heritability estimates of between 40 and 80% 
(3, 4) and numerous susceptibility loci iden- 
tified by genome-wide association studies 
(GWASs) (5-8). Most of these loci overlap non- 
coding regions and encompass hundreds of 
variants that are difficult to study owing to 
linkage disequilibrium (LD), which hampers the 
identification of underlying regulatory variants 
and associated risk genes (9). Salient examples 
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include regions harboring multiple indepen- 
dent association signals, such as the H1 pan- 
neurodegenerative risk haplotype at 17q21.31 
that includes the MAPT locus (J0) and the 
AD risk locus at 19q13.32 that harbors APOE 
(11, 12). 

Computational methods alone have lim- 
ited predictive utility, especially in noncod- 
ing regions or regions of high LD (13-15). It 
is increasingly recognized that functional 
methods are necessary to identify true causal 
variants within most loci, but the sheer num- 
bers of variants challenge most experimental 
approaches. Massively parallel reporter assays 
(MPRAs) provide a solution, enabling high- 
throughput experimental characterization 
of the transcriptional-regulatory potential of 
noncoding DNA elements. In an MPRA, var- 
ious regulatory elements are cloned into an 
expression vector harboring a reporter gene 
and a unique DNA barcode to create an ex- 
pression library that is assayed by means of 
high-throughput sequencing (6, 77). MPRAs 
have prioritized functional common variation 
for multiple traits and disorders (78, 19) but 
have yet to be systematically applied to neu- 
rodegenerative disease. 


Results 
MPRA to identify candidate regulatory 
GWAS variants 


We conducted a staged analysis to identify 
regulatory variants underlying GWAS loci 
for two neurodegenerative tauopathies— 
Alzheimer’s disease and progressive supra- 
nuclear palsy (Fig. 1A). First, we performed 
two MPRAs to screen both alleles of 5706 non- 
coding variants in linkage disequilibrium 
[squared correlation coefficient (R®) > 0.8] with 
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all genome-wide significant polymorphisms 
from two recent AD GWASs (5, 6) and a PSP 
GWAS (8). In the first MPRA (MPRA 1), we 
tested 5223 variants encompassing 14 AD 
and five PSP GWAS loci. In the second MPRA 
(MPRA 2), we replicated 326 variants from 
MPRA 1 to determine assay reproducibility 
and screened another 483 variants encom- 
passing 11 additional AD loci and four newly 
identified suggestive loci for PSP (Fig. 1, B and 
C; fig. S1; and Table 1) (20). 

The performance of MPRAs to detect allelic 
skew depends on high library transfection effi- 
ciency, favoring the use of easy-to-transfect 
immortalized cell lines, such as human em- 
bryonic kidney 293T (HEK293T) cells (27). 
However, we recognized that gene regulatory 
elements may differ between primary and 
immortalized cell lines, as well as between dif- 
ferent primary cell types. Indeed, AD-associated 
variants fall within open chromatin across 
several different neuronal and glial cell types, 
which are largely nonoverlapping (6). We ex- 
plored this further using ENCODE deoxy- 
ribonuclease (DNAse) hypersensitive site (DHS) 
data (22), confirming poor DNA accessibility 
overlap between different brain cell types, such 
as astrocytes and neural progenitors (mean 
Jaccard index = 0.14). Notably, when we com- 
pared this overlap to that of the neuroepithelial- 
derived HEK293T cell line, we found that 
HEK293T cells had the highest mean pair- 
wise Jaccard index (0.22; Fig. 1D) compared 
with active regulatory regions identified across 
all brain tissues. Moreover, we found that AD 
and PSP GWAS variants that fell within open 
chromatin in any brain cell type were also likely 
to fall within open chromatin in HEK293T cells 
(mean = 60%; Fig. 1E). These data indicated 
that HEK293T cells would provide a technically 
tractable model for efficient screening within a 
single line. 

We performed both MPRAs in six indepen- 
dent technical replicates, obtaining activity 
measurements from at least five unique bar- 
codes for both alleles of 5340 of 5706 different 
variants (93.6%; data S1) with median library 
complexities of 40 and 54 barcodes per allele, 
respectively (fig. S2). Consistency within (fig. 
S2) and between the separate MPRA experi- 
ments was high, highlighting the MPRA’s 
reproducibility [activity scores, Pearson’s 
correlation coefficient (r) = 0.98, P< 2 x 107"; 
effect sizes, 7 = 0.94, P< 2 x 10°"; Fig. 2A]. 
Overall, we observed that ~19% of library ele- 
ments were transcriptionally active (fig. S3A), 
in concordance with previous estimates (18, 19). 
Relative to nonactive elements, transcription- 
ally active elements were enriched within ge- 
nomic annotations (HEK293T cells), including 
DHS (open and active chromatin), H3K27Ac 
(active enhancers and promoters), H3K4Me3 
(active promotors), and chromatin immuno- 
precipitation sequencing (ChIP-seq) peaks 
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Fig. 1. Project overview. 
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GWAS variant within open chromatin in a given cell type also overlaps open chromatin in another cell type. Green arrows highlight HEK293T. Error bars, SEM. 


for specific transcription factors (fig. S3B), 
consistent with their likely functionality. We 
also identified significant enrichment of tran- 
scription factor binding sites (TFBSs) within 
active elements, including SP/KLF, ETS, and 
AP-1 family members, and did not find major 
differences in TFBS enrichment when stratify- 
ing our results by disorder (fig. S3C). 
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We applied a stringent statistical thresh- 
old to define functional regulatory variants 
(frVars)—those with significantly different 
transcriptional efficacy between alleles [two- 
sided Mann-Whitney U test, false discovery 
rate (FDR) q < 0.01, Benjamini-Hochberg (BH)], 
identifying 320 different frVars distributed 


across 17 chromosomes (6.0%; Fig. 2B) within 
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27 of 34 tested GWAS loci, with a median of 
two frVars per locus (Fig. 2C and Table 1). 
As expected, effect sizes were generally mod- 
est (mean absolute log, fold change = 0.53; 
Fig. 2D). FrVars were highly enriched within 
library elements that were transcriptionally 
active [odds ratio (OR) = 6.2; P < 2 x 107" 
(20) and within open chromatin from major 
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See 
Table 1. AD and PSP GWAS loci. Description of GWAS loci and variants tested in this study. Shown are the locus lead SNPs and annotated genes, as 
described in the listed GWASs. For each locus, the table shows: the MPRA stage in which it was tested, the number of variants tested per locus, and the 
number of functional regulatory variants with significant allelic skew (FDR q < 0.01; frVars) ultimately identified. Chr, chromosome; Pos, position. 


Chr Pos/cytoband 
1 85603051 


Lead SNP 
rs114573015 


Annotated gene GWAS" 
WDR63 PSP 


MPRA stage No.of SNPstested  frVars 


rs6656401 CRI AD 


Al 28156856 


rs2830500 


PTK2B AD 


MS4A6A AD 


SORL1 AD 


FERMT2 


ADAMI10 


ADAMTS1 


*Source GWAS for tested loci and variants: PSP, (8); AD 1, (5); AD 2, (6). {Locus remapped in AD GWAS 2. Lead SNPs are not in LD. Both lead SNPs were tested here. 


brain cell types, as well as monocytes, which 
derive from the same lineage as microglia 
(Fig. 2E). However, when we separated frVars 
derived from AD versus PSP GWAS loci and 
identified those that were found to overlap 
enhancer marks from recent work in purified 
nuclei from postmortem human brain (23), we 
observed that the cell types affected by each 
disorder were distinct. A plurality of PSP 
frVars fell within neuronal enhancers, which 
was significantly above chance (P = 8.3 x 10°; 
x? test of equal proportion) (20). In contrast, 
a plurality of AD frVars fell within microglial 
enhancers (P = 0.08), in concordance with re- 
cent estimates of cell type-specific enrichment 
in single-nucleotide polymorphism (SNP)- 
based heritability for these two disorders (24) 
(Fig. 2F). Across both disorders, 55/78 (71%) 
of these variants overlapped with cell type- 
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specific enhancer annotations identified in 
postmortem human brain (Fig. 2G), further 
supporting the correspondence of these findings 
with regulatory elements active in human brain 
tissue. Of note, we found functional predictions 
from four popular computational algorithms to 
have poor concordance with regulatory anno- 
tations identified in this study and another 
published MPRA dataset (fig. S4) (78, 20). 


Refinement of frVar annotations for high- 
confidence prediction of regulatory variants 


We next sought to determine the extent to 
which empirical predictions derived from 
HEK293T cells translate to the regulatory land- 
scape of the human brain. We characterized 
frVars using functional annotations from post- 
mortem human brain tissue (23, 25, 26), brain 
and microglial- or monocyte-specific expression 
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quantitative trait loci (e@QTLs) (27-29), predicted 
disruption of TFBSs (30, 31), and brain-specific 
chromatin interaction data (28) (data S2). 
Blood eQTLs (27) were also assessed as a more 
powered approximation of microglia eQTLs, for 
which study sample sizes are relatively small. 
Of 320 frVars, all except one (319/320, 99.7%) 
had at least one functional regulatory anno- 
tation in brain, and the majority (301, 94%) 
overlapped two or more functional annota- 
tions (Fig. 2, H and I, and data S2): 270 (84%) 
were brain, microglial, or blood eQTLs; 238 
(74%) overlapped transcriptionally active re- 
gions in human brain; 88 (28%) fell within a 
promoter or enhancer in at least one brain 
cell type; and 200 (63%) significantly (all P < 
0.05) (20) altered transcription factor binding. 
This indicates that most of the regulatory var- 
iants identified by our assay likely have an 
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Fig. 2. Identification of A 
MPRA functional regula- 
tory variants. (A) Reprodu- 


cibility of MPRA across 


experimental stages. A total 
of 326 variants, including 
186 frVars identified from 
the preliminary assay 
(MPRA 1), were retested in a 
follow-up assay (MPRA 2). 
(i) Reference allele transcrip- 
tional activity and (ii) logs 
effect sizes (alternative 
divided by reference allele) 
show strong correlation 
(Pearson's r = 0.98, 0.94; 
P<2x 10) between 
experiments. BC, barcode 
count. (iii) One hundred 
fifty-two of 186 assessed 
frVars from MPRA 1 

were replicated in MPRA 2 
(Bonferroni P < 0.05). 

(B) Manhattan plot of 

5340 different variants 
assayed across both 
experiments. Red indicates 
frVars at FDR-adjusted 

q < 0.01 (BH method). 

(C) Histogram of the number 
of frVars identified per 
GWAS locus LD block 
(median = 2). (D) Volcano 
plot shows logs allelic skew 
effect sizes and —logio 

P values for 5340 different 
variants tested by MPRA. 
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confidence interval (Cl); all 
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brain tissue (23). (F) Bar plot shows cell type-specific enhancer annotation counts for overlapping frVars separated by disease. A plurality of AD frVars fell within microglial 
enhancers, whereas PSP frVars fell within neuronal enhancers. (G) Most frVars overlap enhancers present in only one cell type. (H) Upset plot shows the number of frVars 
(bars) overlapping combinations (dots and lines below bars) of different functional genomic annotations. Marginal probability of each specific annotation shown in the graphic 
(lower left). (I) Pie chart depicts frVars binned by total number of overlapping genomic annotations (percentages and counts). 


active regulatory role in human brain tissue. 
We used these annotations to further prior- 
itize the highest-likelihood regulatory var- 
iants and nominate downstream cognate 
target genes. 
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Experimental validation of frVars 

On the basis of these functional annotations, 
we selected 42 high-likelihood regulatory var- 
iants for additional validation, choosing those 
that reside within regulatory regions active in 
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brain tissue [Table 2; selection rationale (20)]. 
These variants were distributed across 15 dis- 
tinct AD loci (APOE, BINI, CASS4, CELFI/SPI1, 
CLU, CR1, ECHDC3, EPHAI, FERMT2, HLA- 
DRBI/5, INPP5D, IQCK, MS4A, PTK2B, and 
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Table 2. CRISPR validation of 42 frVars. Summarized results for the 42 frVars validated using CRISPR-Cas9-mediated editing. MPRA frVars are listed with 
logs fold changes. Predicted disruption of transcription factor binding sites (TFBSx) using the union of two algorithms (30, 31) is shown. The method 

used to validate each variant is shown: (i) pooled CRISPRi screen (CROP-seq), (ii) CRISPR enhancer excision assay (CRISPR-ex), or (iii) “None” if the variant 
failed validation. FDR-adjusted (Benjamini-Hochberg method) q-values for CRISPR experiments shown. All CRISPR q < 0.05. 


MPRA Validation 
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SLC24A4/RIN3) and three distinct PSP loci 
(17q21.31, ASAPI, and RUNX2). We used two 
CRISPR-Cas9-mediated strategies to assay 
regulatory regions in their native genomic 
context within neurons, microglia, and astro- 
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0.27 lone 


cytes derived from human induced pluripotent 
stem cells (iPSCs). The first leveraged a pooled 
CRISPR interference (CRISPRi) platform in 
neurons (32) and microglia (33) using CRISPR 
droplet sequencing (CROP-seq) (34) to target 
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EPHA1_Microglia; 
TAS2R60_Microglia 


all 42 variants. Recognizing that the pooled 
screening method using previously untested 
guides would not effectively edit all of the 
targeted regions, we additionally performed 
direct enhancer excision on a subset (five) of 
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these 42 variants that were highly significant 
MPRA hits and whose potential regulatory 
role was supported by functional genomic 
data, such as eQTL and high-throughput 
chromosome conformation capture (Hi-C)— 
1813025717, rs636317, rs9271171, rs36026988, 
and rs10224310—assaying downstream ef- 
fects on gene expression using quantitative 
polymerase chain reaction. We identified 
the candidate target genes for a given var- 
jant using eQTL or chromatin interaction 
data based on Hi-C from the relevant neural 
tissue (Table 2). 

For CROP-seq, each variant was targeted 
using two different guide RNAs, which were 
introduced into iPSCs that were subsequently 
differentiated into neurons or microglia. We 
captured and sequenced 174,983 single neu- 
rons and 130,137 microglia corresponding to a 
median of 1030 distinct neurons and 739 mi- 
croglia per guide (fig. S5, A and B). Single-cell 
clustering and expression of marker genes 
confirmed effective differentiation of both 
cell types (fig. S5C). CROP-seq results for all 
surveyed variant-gene pairs for both cell types 
are summarized in fig. S6. In total, we were 
able to validate 19 variants that regulated at 
least one predicted gene in at least one target 
cell type (Table 2 and data S3), representing a 
validation efficiency of 45%. This validation 
rate was higher than the 11% enhancer vali- 
dation probability defined in a previous CROP- 
seq study (35). We verified the activity of frVars 
at 11 separate GWAS loci, identifying regulatory 
relationships with multiple previously uniden- 
tified risk genes outside of the MAPT and 
APOE loci, including MS4A4E, C4A, APOCI, 
and GPRC5B for AD and PLEKHM1 for PSP 
(Fig. 3, fig. S6, and Table 2). 

Several canonical risk genes, including CRI, 
CLU, and BIN1, have been consistently asso- 
ciated with AD risk across multiple GWASs 
(36), but the causal variants and relevant cell 
types underlying these associations remain 
to be fully clarified. We were able to identify 
and validate functional variants regulating 
each of these three established risk genes. At the 
Clusterin (CLU) locus, we validated intronic 
variant 1s1532277 as regulating CLU expression 
within neurons (Fig. 3, A and B). Although pre- 
vious studies proposed additional risk genes 
at this locus based solely on chromatin inter- 
action assays (23), we did not detect any ef- 
fects of rs1532277 on the expression of distal 
genes CCDC25 or CHRNA2 (Fig. 3B). 

At the complement receptor 1 locus (CRD, we 
validated the distal intronic variant rs6701713 
as regulating CRI expression specifically with- 
in microglia, with the disease-associated allele 
(A) predicted to disrupt binding of CTCF and 
increase expression of CRI [Genotype-Tissue 
Expression (GTEx)] (Fig. 3, C and D; Table 2; 
and data $2). Within the BIN7 locus, previous 
work had rigorously established rs6733839 
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as a causal variant acting specifically in mi- 
croglia (23). However, two other studies pro- 
posed rs13025717 as an additional independent 
risk variant at the locus on the basis of allele- 
specific differential chromatin accessibility 
and chromatin interaction data (37, 38). MPRA 
identified rs13025717 as a highly significant 
frVar (P = 2.6 x 10~*°; Fig. 3E), and we ex- 
perimentally confirmed that this variant reg- 
ulates BINI expression in microglia using 
CRISPR-mediated enhancer excision (Fig. 
3F). We also experimentally validated regu- 
latory variants at other canonical loci includ- 
ing rs10224310 and rs36026988, which we 
show regulate EPHA/ and RIN3 expression, 
respectively, in microglia (Fig. 3, G to J; fig. S6; 
and Table 2). 

We also identified variants with pleiotropic 
regulatory activity, allowing us to identify 
candidate risk genes underlying AD risk. The 
membrane-spanning 4-domains (MS4A) clus- 
ter on chromosome 11 contains more than 
16 genes and harbors numerous variants 
conferring risk for AD (37, 38). Previous work 
at this locus implicated rs636317 as a likely 
causal variant and nominated MS4A6A as 
the cognate risk gene (37, 38). In this study, 
we confirmed the regulatory function of 
18636317, as well as another variant, rs667897, 
within the MS4A locus (Fig. 3K). Notably, we 
found that both variants regulate expression 
of MS4A4E, but not MS4A4A, specifically 
within microglia, with rs667897 also sugges- 
tive for regulation of MS4A6A (Fig. 3, L to O). 
Additionally, using enhancer excision in 
THP-1-derived macrophages, we found that 
rs636317 also regulates MS4A6A expression 
(fig. S7A), although this was not significant 
in the CRISPRi screen (Fig. 3M). The disease- 
associated (T) allele of rs636317 disrupts bind- 
ing of transcriptional repressor CTCF, thereby 
increasing gene expression (38). Conversely, 
the disease-associated (A) allele of rs667897 
creates an NFE2-MAFG TFBS, increasing 
MS4A6A and MS4A4E expression in microglia 
to confer AD risk (Table 2). 

We next explored the HLA-DRBI1/5 AD 
GWAS locus, which is a highly polymorphic 
region of extended LD with numerous AD- 
associated variants and potential risk genes. 
MPRA identified 13 significant variants in the 
locus falling within the intergenic region be- 
tween HLA-DRBI and HLA-DQAI. We further 
validated rs9271171 using enhancer excision in 
iPSC-derived astrocytes, because brain chro- 
matin conformation data from PsychENCODE 
predicted that rs9271171 regulates the distal 
gene complement 4 (C4A), which is most highly 
expressed in astrocytes (39). Indeed, we ob- 
served that excision of this variant significantly 
reduced C4A expression in both astrocytes (P = 
0.04; Fig. 3, P and Q) and THP-1 macrophages 
(P = 0.003; fig. S7B), highlighting C4A as a risk 
gene for AD. Other examples of loci wherein we 
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identified variants regulating multiple down- 
stream target genes include JQCK (rs9935063, 
rs11646891; GPRC5B, IQCK, KNOPI) and SPI 
(rs10876450, rs12223593; SPI1, CELFI, FNBP4) 
(fig. S6 and Table 2). Whether all, or a subset, 
of the genes regulated at these loci contribute 
to disease risk will require individual func- 
tional validation in model systems. 


Characterization of 17q21.31 and 19q13.32 


The chromosome 17q21.31 locus is noteworthy 
for harboring the tau-encoding MAPT gene 
within a common 900-kb inversion poly- 
morphism and is a major risk locus for PSP 
(H1 haplotype OR = 4 to 5) as well as AD, 
Parkinson’s disease, and corticobasal degener- 
ation (10). 17q21.31 contains complex haplo- 
typic substructural variation and extensive 
LD, hampering interrogation with traditional 
statistical genetics approaches (13). We lever- 
aged the ability to functionally dissect this re- 
gion with MPRA, testing 3482 variants within 
17q21.31 in strong LD with lead SNPs from a 
PSP GWAS (8), which comprise ~24% of the 
more than 14,000 common variants in the re- 
gion, identifying a total of 194 frVars, or 5.6% 
of those within this region. 

We next clustered these frVars according to 
LD (minimum R? = 0.5), identifying seven dis- 
tinct LD clusters (Fig. 4A), which suggests 
multiple distinct loci within this region. The 
largest LD cluster includes 42 frVars within 
the core MAPT gene itself [-5 kb upstream of 
the transcriptional start site (TSS) to the 3’ 
untranslated region (3'UTR], of which 13 are 
variants within annotated enhancers predicted 
to disrupt TFBSs (Fig. 4B) and are in strong LD 
(R® > 0.9) with the H1 marker rs8070723 (data 
S4). We also highlight a cluster of eight con- 
tiguous frVars overlapping a large oligoden- 
drocyte enhancer within a distal regulatory 
element of the MAPT first intron (Fig. 4B). 
These variants were of interest because tau 
pathology within oligodendrocytes is a de- 
fining pathological feature of PSP (40). All 
eight of these regulatory variants are in low 
LD (D' = 1, R? = 0.18) with rs24.2557, a marker 
SNP for the H1IC subhaplotype known to con- 
fer risk for PSP (70) (data S4 shows pairwise 
LD for rs242557). 

A core promoter beginning —300 bp from the 
TSS (41 has been previously characterized in 
a variety of cell types (41, 42). As basal MAPT 
expression (in addition to splicing) is thought 
to be a key molecular driver of tauopathy 
pathogenesis (8), we used the functional data 
from MPRA to further clarify the cis-regulatory 
landscape of the MAPT gene. The MPRA ele- 
ment corresponding to positions —226 to —63 of 
the MAPT gene was highly transcriptionally 
active, whereas elements within the region 
-349 to -186 drove only modest expression 
(data S1), suggesting a core promoter starting 
—226 bp upstream of the MAPT TSS (Fig. 4C). 


6 of 13 


RESEARCH | RESEARCH ARTICLE 


Fig. 3. CRISPR validation A 
of eight MPRA frVars 
across seven AD loci. 

(A and B) rs1532277 in the 
CLU locus. (A) Violin plot 
shows MPRA normalized 
barcode (BC) distributions 
for each allele. FDR-adjusted 
q-values displayed. (B) 
CRISPRi targeting suppresses 
CLU (z-score = -3.8, 

q = 2.3 x 10°) but not 
CCDC25 or CHRNA2 expres- 
sion specifically in iPSC- 
derived neurons. Error bars, 
95% Cl. FC, fold change. 

(C and D) rs6701713 in the 
CRI locus. (C) MPRA 

data as in (A). (D) CRISPRi 
targeting suppresses CRI 
expression (z-score = -2.8, 
q = 0.04) specifically in 
microglia. Error bars, 95% Cl. 
(E and F) rs13025717 in 

the BINI locus. (E) MPRA 
data. (F) CRISPR-mediated 
excision of genomic region 
containing rs13025717 K 
in microglia reduces BINI 
expression compared 

with gRNA-scramble controls 
[control n = 9, CRISPR 

n =12, tg = 5.8; P < 0.0001). 
Error bars, SEM. (@ and 
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(K) MS4A locus variants, highlighting rs667897 and rs636317. (L) rs636317 MPRA data. (M) CRISPRi targeting of rs636317 reduces MS4A4E (z-score = -3.0, 

q = 0.02) expression. (N) rs667897 MPRA data. (O) CRISPRi targeting of rs667897 significantly reduces MS4A4E (z-score = -3.7, q = 0.003) and shows a trend 
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We next sought to identify functional variants 
likely to control MAPT gene expression. We 
chose three frVars for further experimental 
validation prioritized by regulatory annota- 
tions in brain and chromatin looping to the 


MAPT TSS (data S2): rs76594.404, within the 
proximal promoter (predicted to disrupt 
EGRI/USF binding); rs76324150, which falls 
within the proximal first MAPT intron (dis- 
rupts ZFX); and rs80346216, within the distal 
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first intron (Fig. 4C). These loci were con- 
firmed using CRISPRi to regulate MAPT ex- 
pression within iPSC-derived neurons (Fig. 4, 
D and E; fig. S6; and Table 2). Additionally, we 
identified a frVar just distal to the MAPT gene 
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Fig. 4. Systematic characterization of 17q21.31. (A) (Top) 17q21.31 LD plot 
(1000 Genomes, CEU) above MPRA frVars clustered by LD. (Bottom) 
frVars plotted by position and MPRA significance (-logi9 P values; colors 
indicate clusters, black indicates unclustered). Most variants fall within LD 

cluster 4 (purple) centering on MAPT. (B) Chromatin annotations and frVars 
across MAPT. Tracks (1 through 8) show H3kK27ac (active) and H3K4me3 

(promoter) ChIP-seq peaks for microglia (red), neurons (green), astrocytes 
(blue), and oligodendrocytes (orange) from (23); CpG islands (track 9); 
and tested variants plotted by —logj9 P value (track 10). FrVars in red. MAPT 
gene model (track 11). Exons: blue, untranslated; white, constitutive; green, 
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alternative neuronal; and yellow, rarely expressed in brain. Blue square shows 
frVars within an oligodendrocyte enhancer. (C) Annotation of the MAPT 
promoter [-4364 to +3292 from TSS, features from (42)] with frVars (red dots). 
Red, repressor regions. (Bottom) MPRA relative reporter activity for two 
elements. (D and E) Three frVars regulating MAPT expression. (D) MPRA 
violin plots show normalized allelic barcode distributions. (E) CRISPRi 
targeting suppresses MAPT expression for rs76594404 (z-score = -9.1, 

q = 9.7 x 1078), rs76324150 (z-score = -4.0, g = 1.5 x 10°), and rs80346216 
(z-score = -2.8, q = 0.04) in iPSC-neurons. (F to I) rs111392251: frVar in 
PLEKHM1 promoter (+3 kb). (F) Tracks (1 through 4) show H3K4me3 peaks 
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[as in (B)]. (G) The alternate allele of rs111392251 disrupts /RF1-3 TFBSs. 


(H) MPRA data as in (D). (I) CRISPRi targeting of rs 


PLEKHMI1 expression (z-score = —3.1, q = 0.01) in neurons but not microglia. 
Error bars, 95% Cl. (J) Variants within the APOE locus plotted by coordinate 


body, rs2532404, that we confirmed strongly 
regulates KANSLI expression in neurons, but 
not microglia (fig. S6). 

We also identified three separate LD clus- 
ters at 17q21.31 harboring variants predicted 
to regulate genes besides MAPT. Putative risk 
genes overlapping regulatory variants in inde- 
pendent LD blocks implicated here include 
MAP3KI4, PLEKHM1, and LRRC37A4P (table 
S1). One such cluster harbors rs111392251, a 
high-confidence regulatory variant located in 
the promoter of PLEKHM1 that is predicted to 
disrupt binding of JRF-family TFs (Fig. 4, F 
to H). We used CRISPRi to validate that this 
locus regulates PLEKHM1] expression within 
neurons but not microglia (Fig. 4D. PLEKHM1 
regulates autophagosome-lysosome formation 
(43) and has been previously suggested as a 
PSP risk gene on the basis of proximity (7), but 
has yet to be functionally characterized in a 
disease context. These results indicate that 
PLEKHM1 dysregulation within human neu- 
rons contributes to PSP risk, independent of 
MAPT expression. 

The APOE locus on 19q13.32 harbors the 
strongest common genetic association with 
late-onset AD, tagging the well-characterized 
APOE4 risk haplotype (5, 6). However, the 
extensive LD in the region has resulted in iden- 
tification of hundreds of additional disease- 
associated variants (11, 12). Recent work has 
uncovered evidence for APOE- independent 
risk for AD in the locus, implicating PVRL2/ 
NECTIN2 and APOCI (11), although some have 
argued that APOE coding variants mediate 
the entire association signal (72). We reasoned 
that identification of functional variants at 
19q13.32 would inform our understanding of 
this complex regulatory architecture. We tested 
640 variants in LD with the 538 genome-wide 
significant SNPs (5) at this locus and iden- 
tified 37 frVars (5.8%; Fig. 4J). Through in- 
tegration with whole brain and blood eQTLs, 
we show that 10 of these were predicted to 
regulate PVRL2 expression, identifying PVRL2 
as a strong candidate risk gene at this locus 
(table $2). With respect to APOCI, we exam- 
ined a distal segment of open chromatin 
within an intergenic region downstream from 
the APOC2-APOC4 gene carrying two frVars, 
1s10423208 and rs117316645 (Fig. 4J), neither 
of which are in LD with the APOE haplotype 
tag SNPs rs7412 and 1s429358 (all R® < 0.01, 
EUR ancestry). CRISPRi targeting of both loci 
robustly down-regulated APOCI expression 
specifically within neurons (Fig. 4, K and L, 
and Table 2). Our results provide support for 
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11392251 suppresses 


APOC! as an AD risk gene, warranting its fur- 
ther study. 


MPRA frVars enrich for TFBS disruption 


We next hypothesized that functional variants 
defined by our frVars distributed across the 
genome would be enriched for variants that 
disrupt TF binding as a class, which was what 
we observed [enrichment OR = 1.4, P = 0.003; 
SNP2TFBS (30)]. We observed a similar magni- 
tude of enrichment of TFBS-disrupting variants 
among frVars in another published dataset (78) 
(OR = 1.9, P = 8.7 x 10°). Both enrichments 
were replicated using an alternative TFBS 
scoring method [motifbreakR (37)]. Further- 
more, the magnitude and directionality of pre- 
dicted TFBS disruption correlated with MPRA 
effect sizes for frVars in both our dataset 
(Spearman’s rho = 0.44, P = 2.3 x 10 *) and the 
other dataset (18) (rho = 0.52, P < 8.7 x 10°°; 
Fig. 5, A and B). In general, we did not find 
that TFBS disruption alone predicts MPRA 
allelic skew (positive predictive value = 0.14) 
(20), although disruption of a subset of specific 
TFs such as ELK4, ETSI, GABPA, and SP- 
family members did modestly predict allelic 
skew captured by MPRA in these data (fig. S8). 


Enrichment of functional risk variants within 
disease-specific transcriptional networks 


We next assessed whether frVars were enriched 
within binding sites for specific transcription 
factors, and we found that TFBSs disrupted by 
risk variants differed by disease. In AD, CTCF 
(logy OR = 4.5, P = 3 x 10°”), NR4A2 (logs 
OR = 4.9, P = 0.002), NR5A2 (logs OR = 3.6, P= 
0.01), ATOH] (logs OR = 3.8, P = 0.009), SP2 
dogs OR = 2.3, P = 0.008), and SMAD-family 
(SMAD2,3,4 heterotrimer, log, OR = 3.7, P = 
0.0002) binding sites were enriched for dis- 
rupting risk variants, which is consistent with 
previous work identifying enrichment of AD 
risk variation in microglial enhancers con- 
taining CTCF, SMAD, and SP binding motifs 
(38). In contrast, PSP showed a different pat- 
tern of TFBS enrichment, in which five of the 
six TFs predicted to be enriched for binding 
site disruption physically interact with the 
transcription factor SP1 (including SP1 itself; 
OR= 1.8, P = 0.007; Fig. 5C). Of note, the set of 
PSP-enriched TFs forms a significant protein- 
protein interaction (PPI) network with SP1 
(STRING P = 1.7 x 10°; permutation P = 0.05; 
Fig. 5C) (20). This finding is consistent with 
SP1’s multimerization capabilities and its ac- 
tivity as a core component of a broad array 


of gene regulatory complexes that regulate 
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and MPRA -logio P value. FrVars in red, blue dashes loop to APOC1 promoter. 
(K) MPRA data and (L) CRISPRi targeting of rsl10423208 (z-score = -5.5, 

q = 1.5 x 10~) and rs117316645 (z-score = -2.8, q = 0.04) suppresses APOCI 
expression in neurons but not microglia. Error bars, 95% Cl. 


tissue-specific gene expression (44). Notably, 
11.1% of annotated PSP frVars are predicted 
to participate specifically within this network. 
We also assessed enrichments at a lower sta- 
tistical threshold (TFBS enrichment g < 0.1), 
finding additional ETS-family TFs and SP1 
interactors to be enriched within functional 
PSP risk variants (fig. S9). Although these TFs 
are highly expressed in HEK293T cells, we did 
not find a general relationship between rela- 
tive TF abundance and MPRA allelic skew (P > 
0.05; Spearman’s correlation) (20), indicat- 
ing that expression levels do not explain our 
results. Additionally, we did not identify a sig- 
nificant enrichment for disrupted TFBS motifs 
among equivalently sized random collections 
of GWAS variants (20), demonstrating that 
our results are not explained by the initial SNP 
selection. 

Given these findings, we sought to verify 
whether our computational predictions of 
TFBS disruption were robust by performing 
electrophoretic mobility shift assays (EMSAs). 
We focused on SPI because it was the core hub 
gene in the PPI network, testing the relative 
SP1 binding affinity for each of the 14 frVars 
predicted to disrupt SP1. binding. We iden- 
tified significant differential allele-specific 
binding affinities in 10 of 14 variants (g < 0.1; 
Fig. 5D and fig. S10) and found a strong cor- 
relation between EMSA results and MPRA 
effect sizes (Fig. 5E), confirming the bioinfor- 
matic regulatory predictions of SP1 activity. 

We next reasoned that we could leverage 
expression data from postmortem human 
brain tissue to garner additional evidence for 
dysregulation of SP1 network TFs in disease. 
Indeed, gene expression analysis confirmed 
that a majority of these network TFs, includ- 
ing SPI, were differentially expressed within 
the temporal cortex of individuals with PSP 
and matched controls (Fig. 5F and fig. S11A). 
Of note, the coding region for SP1 itself falls 
within a suggestive PSP risk locus that did not 
reach genome-wide significance (locus com- 
bined P = 4.1 x 107”) (8). Taken together, these 
data provide converging evidence for broad 
dysregulation of an SP1-centered transcrip- 
tional network in PSP pathophysiology. 

To explore this further, we generated a 
two-layer directed network composed of these 
six PSP TFs and their likely targets, defined as 
genes within the cis-regulatory window of 
TFBS-disrupting frVars, as well as genes linked 
by brain Hi-C (Fig. 5G; fig. S11B for AD net- 
work) (20). These TFBS-disrupting variants 
were then annotated using cell type-specific 
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Fig. 5. FrVars disrupt 
disease-specific TF 
networks. (A and 

B) MPRA variant effect 
sizes correlate with 
predicted TFBS 
disruption scores for 
our dataset (A) and a 
published dataset 

(18) (B). Red line, OLS 
regression, Spearman's 
rho. (C) Shows the 
logs enrichments for 
significantly disrupted 
TFBSs (FDR q < 0.05, 
BH method), with frVars 
from AD and PSP 
analyzed separately 
(color indicates —logio 
enrichment P values, 
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disease risk for two neurodegenerative dis- 
orders, AD and PSP, identifying regulatory 
variants at 27 of 34 tested genomic loci. We 
use functional genomic annotations from hu- 
man brain tissue to prioritize these variants, 
experimentally validating the activity of 19 var- 
iants and their cognate target genes using 
direct genome manipulation. Our work illus- 
trates the utility of integrating multiplexed 
reporter and CRISPR assays to efficiently char- 
acterize noncoding disease-associated varia- 
tion, suggesting a framework for future work 
in other complex traits. In agreement with 
previous studies (/4), we found that four exist- 
ing computational prediction algorithms for 
a priori variant prioritization were not highly 
concordant among themselves, nor were they 
predictive of functional variants identified 
by our MPRA or in a previously published 
dataset (18) [fig. S4; see (20)]. This empha- 
sizes the necessity for further development 
and utilization of high-throughput experimen- 
tal methods to prioritize noncoding regulatory 
variants from GWAS, especially in regions 
with extensive LD. In this study, we used 
MPRA and CRISPR to identify regulatory var- 
iants within two such regions, 17q21.31 and 
19q13.32, risk factors for PSP and AD, respec- 
tively, enabling identification of risk genes 
within these complex loci, including PLEKHM1 
and APOCI. 

A potential limitation of this study was the 
technical challenge of performing MPRA with- 
in human brain cell types, with the specific 
trans-regulatory environment of HEK293T 
cells likely influencing the generalizability 
of our screen (79). Recognizing that HEK293T 
cells are not primary neural cells, we directly 
addressed the question of cell type specificity 
through integration with orthogonal genomics 
annotations from the human brain (which 
reassuringly showed high overlap), followed 
by CRISPR validation of 42 variants within 
their native genomic context in neurons and 
microglia. In doing so, we provide strong evi- 
dence for multiple risk genes, implicating 
PLEKHM1] and KANSL1 in PSP and MS4A4E, 
APOC, and C4A in AD. Of note, our analysis did 
not address regulatory function within neuro- 
vasculature and epithelial cells, which may also 
contribute to AD pathogenesis (46). Additionally, 
some associated loci likely operate by means of 
mechanisms independent of direct transcrip- 
tional regulation (i.e., splicing, posttranslational 
modification), requiring other approaches to 
delineate causal variants and their mecha- 
nisms. However, this work takes an important 
step forward by providing large-scale func- 
tional annotation of these dementia risk loci. 

SNP-based heritability is known to enrich 
in regulatory regions within disease-relevant 
tissues (47). For AD, this enrichment affects 
networks of genes downstream of several TFs, 
including CTCF and SMAD, which have been 
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described as acting in multiple brain and pe- 
ripheral cell types to affect risk for AD (38, 48). 
For PSP, our analysis identified an enrichment 
of TFBS-disrupting frVars in a PPI network 
with the transcription factor SP1, which reg- 
ulates a broad array of cellular processes, in- 
cluding chromatin remodeling, apoptosis, 
immune regulation, and response to oxidative 
stress in neurons (49). Although SP1 network 
dysregulation has been identified in the AD 
brain (50), it does not appear to harbor AD 
genetic risk and therefore is likely to play a 
reactive or secondary role in AD. Overall, our 
data are consistent with PSP risk primarily 
affecting neurons and, to a lesser extent, oligo- 
dendrocytes (24, 45). This includes an oligo- 
dendrocyte enhancer within the tau locus, 
which we speculate may be related to the dis- 
tinctive glial pathology in PSP. 

These observations suggest a refined model 
for understanding the combinatorial effects of 
common genetic risk. Signals from intergenic 
GWAS loci are typically interpreted as deriving 
from causal regulatory variants that influence 
downstream expression of specific cognate 
risk genes. Our results, implicating a TF net- 
work converging on SP1 in PSP, are consistent 
with a model whereby common genetic var- 
iants function in aggregate across multiple 
TFBSs to disrupt key cell type-specific tran- 
scriptional programs. We speculate that this 
genetic mechanism may manifest itself only 
upon induction of the relevant transcriptional 
network, which may occur specifically within 
a disease context. Our data show that disease- 
relevant transcriptional networks regulate a 
large number of cell type-enriched genes, 
providing a mechanism whereby genetic risk 
is expressed not by affecting a few core genes 
(51) but by means of polygenic cell type-specific 
regulatory effects on networks of genes (52). 


Materials and methods summary 


A full description of the materials and methods 
is provided in the supplementary materials 
(20). An abbreviated version is provided below. 


Massively parallel reporter assay 


Variant selection and MPRA were performed 
as described (20). To identify functional regu- 
latory variants with significant allelic skew 
(frVars), normalized barcode counts were com- 
bined across replicates and significance deter- 
mined using a two-way Mann-Whitney U test 
comparing barcode counts between each allele. 
FrVars were defined at an FDR threshold qg < 
0.01 (Benjamini-Hochberg method). MPRA log» 
effect sizes were defined as the median summed 
normalized barcode count for the alternate 
allele minus the reference allele. 


Genome engineering 


For CRISPR excision experiments, a single 
line from a healthy male donor (#1205-4) was 
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derived and cultured as previously documented 
at the University of California, Los Angeles 
(53). IPSC differentiation into mature astro- 
cytes or microglia, as well as further descrip- 
tion of the CRISPR excision experiments, is 
described here (20). 

For the CROP-seq (pooled CRISPRi) screen, 
two iPSC lines were maintained at the Univer- 
sity of California, San Francisco. For microglial 
differentiation, a transcription factor-based 
protocol (iTF-microglia) was used as described 
by Drager et al. (33). Similarly, neurons for 
the CROP-seq screen were generated using 
the transcription factor-based i?N approach 
as described by Tian et al. (32). For CROP-seq 
validation, we picked 42 frVars that were 
also annotated as lying within transcrip- 
tionally active regulatory regions defined 
as overlapping either brain enhancer and 
promoter marks from (23) or overlapping 
“active” brain ChromHMM marks from the 
NIH Roadmap Epigenomics Mapping Con- 
sortium (25), and analysis performed as de- 
scribed (20). 


Bioinformatic analysis 
Variant annotation 


FrVars were annotated for TFBS disruption 
and overlapped with functional brain anno- 
tations to create a high-confidence list of likely 
regulatory variants (data S2). We calculated 
TFBS disruption using both the motifbreakR 
package (37) and the SNP2TFBS webtool (30). 
We then identified frVar overlap with en- 
hancer and promoter annotations from sorted 
microglia, neurons, astrocytes, and oligoden- 
drocytes (23) as described (20). 


TFBS analysis 


Predicted TFBS enrichment within “active” 
MPRA elements was calculated using the 
HOMER (4.11) software suite (54) against a 
background set of all other oligos (20). Pre- 
dicted TFBS disruption was also scored for 
all variants using the SNP2TFBS webtool (30), 
and an enrichment OR for frVars was calcu- 
lated using Fisher’s exact test. We also par- 
titioned frVars into those derived from AD or 
PSP GWAS (20) and ran these sets through the 
SNP2TFBS algorithm, filtering for TFBSs with 
at least two predicted disruptions. Only TFBSs 
significantly (g < 0.05 or q < 0.1) enriched for 
disruption by frVars were plotted and con- 
sidered further. For these significantly dis- 
rupted TFs, we created PPI networks using 
the STRING (v11) webtool and standard pa- 
rameters (55), identifying target genes and cell 
type enrichment as described using functional 
genomic data (20). 


Additional analyses 


Pairwise Jaccard index calculations, open chro- 
matin enrichment analysis, bulk RNA-seq 
analysis, LD analysis within the 17q21.31 locus, 
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and other analyses are further described in the 
supplemental methods (20). 


Statistical reporting 


Statistical analysis was performed using the 
stats package in R. All hypothesis testing was 
two-sided, and all enrichment was determined 
using a Fisher’s exact test, except when ex- 
plicitly stated. For MPRA allelic skew multi- 
ple testing correction, unique variants from 
MPRA 1 and 2 were combined (5340 total) and 
Mann-Whitney U P values were FDR-adjusted 
(Benjamini-Hochberg method). FrVars were 
called at a threshold of g < 0.01. For CRISPR 
excision experiments, a two-tailed Student’s 
ttest was performed by comparing the average 
value of controls against the average value of 
enhancer-targeting conditions. For CRISPRi 
experiments, SNP:gene P values were com- 
puted and FDR-adjusted as described (20). 
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PARKINSON'S DISEASE 


GPNMB confers risk for Parkinson’s disease through 
interaction with a-synuclein 


Maria E. Diaz-Ortiz}, Yunji Seot, Marijan Posavi, Marc Carceles Cordon, Elisia Clark, Nimansha Jain, 
Rakshita Charan, Michael D. Gallagher, Travis L. Unger, Noor Amari, R. Tyler Skrinak, 

Roseanne Davila-Rivera, Eliza M. Brody, Noah Han, Rebecca Zack, Vivianna M. Van Deerlin, 

Thomas F. Tropea, Kelvin C. Luk, Edward B. Lee, Daniel Weintraub, Alice S. Chen-Plotkin* 


INTRODUCTION: An estimated 6 million people 
worldwide are affected by Parkinson’s disease 
(PD), for which there are no disease-modifying 
therapies. Since 2005, genome-wide associa- 
tion studies (GWASs) have aimed to find com- 
mon variant risk loci for PD. These PD GWASs 
have now uncovered 90 genome-wide sig- 
nificant risk signals. However, target genes 
and mechanisms remain largely unknown, 
hampering the field’s ability to develop 
downstream therapeutic approaches from 
these genomic studies. 


RATIONALE: We dissected a chromosome 7 locus 
linked to risk for PD by GWASs to find the target 
gene and understand its role in PD patho- 
physiology. We first performed colocalization 
analyses of expression quantitative trait locus 
(eQTL) and PD risk signals, confirming eQTL ef- 
fects with allele-specific expression (ASE) studies 
of candidate target genes in the human brain. 
We then characterized the consequences of ma- 
nipulating target gene expression in human 
induced pluripotent stem cell-derived neu- 
rons (iPSC-Ns) and other cell models. Finally, 


GPNMB 
protein 


Neuron A 


Effects on: 
¥ Synaptic expression 
¥ Uptake of aSyn PFF 
¥ aSyn pathology 


we determined whether levels of the target 
gene or protein were associated with disease 
state and severity in humans. 


RESULTS: We linked the GWAS-derived chromo- 
some 7 PD risk locus [sentinel single-nucleotide 
polymorphism (SNP) rs199347] to GPNMB— 
which encodes the transmembrane protein 
glycoprotein nonmetastatic melanoma pro- 
tein B (GPNMB)—with a 94% posterior prob- 
ability of the PD risk and caudate GPNMB 
eQTL signals sharing a causal variant. In hu- 
man brain samples across multiple brain re- 
gions (the caudate, cerebellum, and cingulate 
cortex) from both PD cases (n = 4) and 
neurologically normal controls (n = 2), ASE 
studies confirmed that the PD risk-associated 
haplotype was associated with threefold 
higher GPNMB expression. 

Having linked GPNMB expression levels to 
PD risk, we generated iPSC-N lines with nor- 
mal levels of GPNMB [wild-type (WT)], hetero- 
zygous loss of GPNMB (Het), or homozygous 
loss of GPNMB [knockout (KO); two differ- 
ent lines generated] by CRISPR-Cas9 genome 
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editing. Analyzing these iPSC-Ns using confocal 
microscopy, we found that loss of GPNMB 
was accompanied by a marked reduction in 
a-synuclein (aSyn) at the synapse—a result 
confirmed by immunoblotting of synaptosomal 
preparations for aSyn. Culturing the iPSC-Ns 
for an additional week, to a state of greater 
maturity, only exaggerated the differences in 
synaptic aSyn when comparing WT versus all 
other lines. Moreover, transcriptomic profiling 
of these genome-edited iPSC-N lines revealed 
that aSyn served as a key protein-protein inter- 
action hub for genes with dysregulated expres- 
sion in GPNMB Het and KO states. 

Because aSyn is the protein that accumu- 
lates in the hallmark pathological inclusions of 
PD, and because progression of PD may result 
from the uptake and cell-to-cell spread of path- 
ological forms of aSyn, we further explored 
the relationship between GPNMB and aSyn. In 
HEK293 and HeLa cells, we found that GPNMB 
and aSyn colocalized and coimmunoprecipi- 
tated. In iPSC-Ns, we observed minimal uptake 
of fibrillar forms of aSyn in GPNMB Het and 
KO iPSC-Ns, in contrast to rapid and obvious 
uptake of fibrillar aSyn by WT iPSC-Ns. Addi- 
tionally, when iPSC-Ns were cultured for 14 days 
after a one-time addition of fibrillar aSyn to the 
culture medium, WT iPSC-Ns developed hyper- 
phosphorylated, insoluble inclusions of aSyn, 
but these forms of aSyn pathology were minimal 
in GPNMB Het and KO iPSC-Ns. These exper- 
iments, together, demonstrated that GPNMB 
was necessary for internalization of fibrillar aSyn 
and subsequent development of aSyn pathology. 
Meanwhile, in cell lines with low GPNMB ex- 
pression that do not normally internalize fibril- 
lar aSyn, exogenous expression of GPNMB was 
sufficient to confer ability to take up aSyn. Fi- 
nally, using samples from 731 PD and 59 neu- 
rologically normal control individuals, we found 
that plasma GPNMB is elevated in PD and that 
PD individuals with higher plasma GPNMB 
levels have more severe disease. 


CONCLUSION: Computational, cell biological, 
and human tissue-based studies establish 
GPNMB as a GWAS-derived risk gene for PD, 
with higher expression levels mediating path- 
ogenicity through interactions with aSyn. As a 
transmembrane protein that is expressed at 
the cell surface, with an extracellular, soluble 
form generated by cleavage, GPNMB is a can- 
didate for biomarker development and ther- 
apeutic targeting in PD. 
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PARKINSON’S DISEASE 


GPNMB confers risk for Parkinson’s disease through 
interaction with a-synuclein 


Maria E. Diaz-Ortiz'?+, Yunji Seo'}, Marijan Posavi', Marc Carceles Cordon’, Elisia Clark’, 
Nimansha Jain'*, Rakshita Charan", Michael D. Gallagher"®, Travis L. Unger’, Noor Amari’, 
R. Tyler Skrinak’, Roseanne Davila-Rivera’, Eliza M. Brody’, Noah Han’, Rebecca Zack’, 
Vivianna M. Van Deerlin®, Thomas F. Tropea’, Kelvin C. Luk®, Edward B. Lee®, 


Daniel Weintraub”*®, Alice $. Chen-Plotkin'* 


Many risk loci for Parkinson’s disease (PD) have been identified by genome-wide association studies 
(GWASs), but target genes and mechanisms remain largely unknown. We linked the GWAS-derived 
chromosome 7 locus (sentinel single-nucleotide polymorphism rs199347) to GPNMB through 
colocalization analyses of expression quantitative trait locus and PD risk signals, confirmed by 
allele-specific expression studies in the human brain. In cells, glycoprotein nonmetastatic melanoma 
protein B (GPNMB) coimmunoprecipitated and colocalized with a-synuclein (aSyn). In induced 
pluripotent stem cell-derived neurons, loss of GPNMB resulted in loss of ability to internalize aSyn 
fibrils and develop aSyn pathology. In 731 PD and 59 control biosamples, GPNMB was elevated in PD 
plasma, associating with disease severity. Thus, GPNMB represents a PD risk gene with potential 

for biomarker development and therapeutic targeting. 


arkinson’s disease (PD) is a progressive 
neurodegenerative disease affecting an 
estimated 5 million people worldwide 
(2). Although genome-wide association 
studies (GWASs) have nominated >80 ge- 
netic loci as PD risk factors (2), target genes and 
mechanisms for these loci remain largely un- 
known. To date, drug development in PD has 
focused on pathways related to a-synuclein 
(aSyn)—the main component of disease-defining 
neuropathological lesions—or a handful of other 
genes causal for PD in a small minority of pa- 
tients (3, 4). However, these efforts have not yet 
yielded successful disease-modifying therapies. 
In this work, we sought to widen the net for 
PD therapeutic targets by dissecting a com- 
mon variant GWAS-derived PD risk locus. We 
used computational, cell biological, and hu- 
man tissue-based approaches to ascertain that 
GPNMB is a bona fide PD risk gene. Moreover, 
we implicated glycoprotein nonmetastatic 
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melanoma protein B (GPNMB)-aSyn inter- 
actions and GPNMB expression levels in the 
development of PD. 


GPNMB is the target gene at the rs199347 PD 
risk locus 


Multiple GWASs have linked a chromosome 
7 locus [sentinel single-nucleotide polymor- 
phism (SNP) rs199347] with PD (2, 5). Because 
1s199347 and linked variants are noncoding, 
we hypothesized that this locus confers PD risk 
through the modulation of expression levels 
of one or more target genes, as has been shown 
for a number of common variant GWAS-derived 
risk factors in other diseases (6-9). To nomi- 
nate candidate target genes, we used the Geno- 
type Tissue Expression (GTEx) database (J0), 
finding genes for which a shared causal var- 
iant could be linked to both PD risk and ex- 
pression levels in normal human tissue. Target 
genes within a 1-Mb window of the sentinel 
SNP were evaluated in colocalization analy- 
ses (11, 12): GPNMB, KLHL7, and NUPL2 (fig. 
S1). Among these, GPNMB demonstrated the 
greatest overlap between PD risk and expres- 
sion quantitative trait locus (eQTL) effects, 
with the most significant colocalization effects 
seen in the PD-affected caudate brain region 
(fig. SIB). Moreover, when we examined eQTL 
effects more broadly, we found that rs199347 
genotypes associated significantly with GPNMB 
expression levels in all GTEx-characterized 
brain regions as well as in whole blood (Fig. 1A). 
Thus, computational analyses of existing data- 
bases strongly implicated GPNMB as the tar- 
get gene at this locus, with higher levels of brain 
and blood expression linked to PD risk. 
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The mechanistic basis for eQTL effects lies 
in allele-specific expression (ASE), where- 
by different alleles at a given locus show 
different levels of expression (73). Thus, to 
validate the database-predicted eQTL rela- 
tionship between rs199347 and GPNMB expres- 
sion, we developed a capture RNA sequencing 
(RNA-seq)-based (74) ASE assay for human 
brain tissue (Fig. 1B) and used it to investigate 
whether individuals carrying two different 
alleles for the rs199347 haplotype demonstrated 
ASE for GPNMB. Moreover, to understand 
whether rs199347 haplotype effects on brain 
tissue expression extended to PD brains, we 
characterized multiple brain regions with 
varying degrees of pathological involvement in 
PD (in order of greatest to least aSyn patholog- 
ical involvement: the caudate, cingulate, and 
cerebellum) from both PD cases (n = 4) and 
neurologically normal controls (NCs) (7 = 2; 
extended demographic information is avail- 
able in table S1). 

We found a high degree of ASE for GPNMB 
regardless of PD status or brain region, with 
15 of 15 brain samples showing significant ASE 
(Fig. 1C). Moreover, all samples displayed the 
same direction of effect, with the PD risk haplo- 
type associating with higher expression levels, 
averaging approximately threefold higher than 
the PD protective haplotype. By contrast, ASE 
analyses for another gene at this locus, KLHL7, 
did not show preferential expression of either 
haplotype (Fig. 1C). Finally, we confirmed that 
GPNMB is expressed at the protein level in the 
human brain, examining multiple postmortem 
cases (table S1) by immunoblotting (Fig. 1D), 
which demonstrated a main band at the pre- 
dicted molecular weight of 72 kD across all 
cases, as well as a variably expressed higher- 
molecular weight band that collapses to 72 kD 
with deglycosylation (fig. S2), consistent with 
prior accounts of GPNMB’s variably glycosyl- 
ated forms (/5). To better understand what 
cell types might be expressing GPNMB, we 
also performed immunohistochemical stain- 
ing of human brain tissue (Fig. IE and table S1) 
and found GPNMB expression in multiple cell 
types, consistent with studies of spinal cord 
tissue (16). Thus, both database-predicted and 
empirically validated eQTL studies in human 
PD samples support GPNMB as the target gene 
at the rs199347 locus, with protein expression 
in the brain. 


Loss of GPNMB in iPSC-Ns decreases 

synaptic aSyn 

Having nominated GPNMB as the mechanis- 
tic link between a GWAS locus and PD risk, 
we sought to define its function in a disease- 
relevant context. GPNMB encodes the GPNMB 
protein, a transmembrane glycoprotein first 
identified in a screen of human melanoma 
cells with varying potential for metastasis (17). 
Although GPNMB has subsequently been 
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Fig. 1. Chromosome 7 PD risk locus is a multitissue GPNMB eQTL and 
shows ASE. (A) GTEx normalized expression size (NES) coefficients for 
rs199347's effect on GPNMB expression in various tissues (10). Red indicates 
whole blood, and yellow and orange indicate brain regions. Orange bars 
correspond to brain regions analyzed for ASE. The PD risk allele at rs199347 
(allele = A) is uniformly associated with higher GPNMB mRNA expression. 

(B) Workflow for ASE experiment in brain samples from PD patients (n = 4) and 
NCs (n = 2). (€) ASE analysis for rs199347 target gene candidates GPNMB 
and KLHL7 in caudate, cerebellum, and cingulate cortex brain samples of PD 
(red) and NC (blue) individuals heterozygous at this locus. Shaded dots indicate 


significant (Benjamini-Hochberg—adjusted P < 0.05) ASE, whereas empty dots do 
not show significant ASE. In the absence of ASE, the allelic ratio would be 0.5. 
For GPNMB, the allelic ratio approached 0.8, with higher expression for the 
GPNMB allele carrying the PD risk haplotype. n.s., not significant. (D) Immunoblots 
showing GPNMB expression in caudate brain lysates from NC (n = 5) and PD (n = 5) 
individuals. A 72-kD band was detected by the E1Y7J antibody in all cases, 

with variable expression of a higher—molecular weight glycosylated form. A 
glyceraldehyde-3-phosphate dehydrogenase (GAPDH) loading control is also 
shown. (E) Representative immunohistochemical staining of GPNMB in NC and 
PD brains demonstrates expression in multiple cell types. Scale bar, 50 ym. 


found to enhance malignant phenotypes ina 
number of cancers (/8), characterization of 
its function in the brain or in neurons is scant 
(15, 16, 19-21). We thus turned to a rapidly in- 
ducible human induced pluripotent stem cell- 
derived cortical neuron (iPSC-N) model (22) in 
which we could directly manipulate GPNMB 
levels to investigate its neuronal function. 
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Using CRISPR-Cas9 genome editing, we 
generated iPSC-N lines with normal levels of 
GPNMB [wild-type (WT)], heterozygous loss of 
GPNMB (Het), or homozygous loss of GPNMB 
[two lines: knockout 1 (KO1) and KO2; figs. 
S3 and S4] and assessed them by confocal 
microscopy. At a time point when morphology, 
expression of neuronal markers, and ability 
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to fire trains of action potentials reportedly 
indicate maturity [14 days after neuronal in- 
duction (22)], loss of GPNMB was accom- 
panied by a marked reduction in aSyn at the 
synapse (Fig. 2, A and B). Moreover, similar 
levels of decrease were observed in the Het, 
KOI, and KO2 lines when we quantified mi- 
croscopy results (Fig. 2C). By contrast, staining 
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Fig. 2. Heterozygous and homozygous loss of GPNMB results in altered 
aSyn expression and localization. (A) Immunofluorescence imaging of aSyn 
(SNCA, green, stained with MJFRI antibody), synapsin-1 (SYN1, red), and B-tubulin 
(TUBB3, cyan) in GPNMB WT, Het, KO1, and KO2 iPSC-Ns acquired 14 days 

after induction of neuronal differentiation. Scale bars, 20 um. (B) 2x magnified 
images of merged insets from (A) demonstrate greatly reduced localization of aSyn 
(SNCA, green) to synapses (labeled with SYN1, red) in GPNMB KO iPSC-Ns 
compared with WT. TUBB3 is converted to gray. (C) Quantification of aSyn (top) and 
synapsin-1 (bottom). N = 15 images (five images from each of three wells) per 
condition. Means + SEMs, as well as individual data points, are shown. Statistics are 
by one-way ANOVA followed by post hoc Dunnett's test comparing all other groups 
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of synapsin-1, another presynaptic terminal 
protein, did not differ significantly among the 
iPSC-N lines (Fig. 2C). 

To corroborate our immunofluorescence 
findings, we isolated synaptosomes from the 
WT, Het, KO1, and KO2 iPSC-N lines and 
quantified aSyn by immunoblot in both the 
synaptosomal and cytosolic fractions (Fig. 2, 
Dto M). At the same time point as our micros- 
copy experiments, synaptosomal aSyn was de- 
creased in the Het and KO1 lines (Fig. 2, D 
and F). By contrast, neither synaptosomal nor 
cytosolic synapsin-1 levels differed significantly 
among the WT, Het, KO1, and KO2 lines (Fig. 2, 
Gand H). 

Because aSyn expression at the synapse in- 
creases as neurons mature (23, 24), we inves- 
tigated whether the observed loss of aSyn could 
be the result of a delay in neuronal matura- 
tion that would improve over time. However, 
after an additional week of neuronal differ- 
entiation, differences in aSyn only increased, 
with the GPNMB Het, KOI, and KO2 lines 
showing ~’75% reduction in the synaptosomal 
aSyn fraction compared with the WT line 
(Fig. 2, land K). In addition, by 21 days after 
induction, both aSyn and synapsin-1 were 
found primarily in the synaptosomal fraction, 
and synapsin-1 levels were also significantly 
decreased in the GPNMB Het, KO1, and KO2 
lines compared with the WT line (Fig. 2M). 

Thus, decreased expression of GPNMB in 
neurons leads to loss of aSyn at the synapse, 
which suggests direct links between the rs19934:7 
risk haplotype, GPNMB expression, and aSyn. 
Moreover, loss of GPNMB may result in a broader 
synaptic defect. 


Transcriptomic profiling of iPSC-Ns reveals 
GPNMB effects on synaptic biology and aSyn 


To obtain an unbiased view of biological changes 
induced by loss of GPNMB in neurons, we tran- 
scriptionally profiled five independent repli- 
cates each of the GPNMB WT, Het, and KO2 
(hereafter referred to as just KO) iPSC-N lines. 
Although all three lines showed similar gene 
expression at baseline, upon neuronal differen- 
tiation, distinct profiles emerged for the WT 
versus the Het and KO cells (fig. S5). Specifi- 
cally, regardless of whether we performed hier- 
archical clustering using the top 35 differentially 
expressed genes (Fig. 3A and fig. S5) or principal 
components analysis (PCA) using all expressed 
genes (Fig. 3B and fig. S5), GPNMB WT lines 
separated from the Het and KO lines, whereas 
within-line replicates clustered together. As ex- 
pected, across all cell lines, as neuronal differen- 
tiation occurred, neuronal genes (e.g., MAP2, 
TUBB3, and SYN1) were up-regulated, where- 
as pluripotency genes (e.g., SOX2) were down- 
regulated (fig. S6). 

In the neuronally differentiated lines, over- 
lap among genes dysregulated in the GPNMB 
Het and GPNMB KO lines compared with the 
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WT line was considerable, with 142 and 449 
overlapping up-regulated and down-regulated 
genes, respectively (out of 177 and 610 up- and 
down-regulated genes, respectively, in the Het 
condition, and 176 and 525 up- and down- 
regulated genes, respectively, in the KO condi- 
tion). By contrast, relatively few genes showed 
differential expression comparing the GPNMB 
Het versus KO lines (Fig. 3, C to E). 

Globally, two modules of coexpressed genes 
distinguished WT from GPNMB Het and KO 
iPSC-Ns, with one module (M1) increased in 
expression in WT compared with GPNMB Het 
and KO iPSC-Ns and one module (M2) de- 
creased in expression in WT iPSC-Ns (fig. S5). 
Biological pathway analysis of M1 genes re- 
vealed a highly significant overrepresentation 
of synapse-related categories (Fig. 3F and table 
$2), corroborating our cell biological findings. 
The top three categories overrepresented in 
M2 genes related to receptor tyrosine kinases 
(Fig. 3G and table S83). 

Using module coexpression network analysis 
with the GeneMANIA (25) database of protein- 
protein interactions (Fig. 3, H and I), we found 
that aSyn (encoded by SNCA) served as a key 
protein-protein interaction hub for M1 genes 
with dysregulated expression in GPNMB Het 
and KO states. These data—from unbiased 
bottom-up analyses of global gene expression— 
highlight the strong functional effects of GPNMB 
loss on aSyn. 


GPNMB is necessary and sufficient for cellular 
internalization of aSyn 


Because data from the edited iPSC-N lines 
supported a functional interaction between 
GPNMB and aSyn from both targeted and 
unbiased lines of investigation, we investi- 
gated whether these two proteins might di- 
rectly interact. In iPSC-Ns, GPNMB expression 
is not high, which makes visualization and co- 
immunoprecipitation (co-IP) experiments chal- 
lenging, so we began with immortalized cell 
lines. Confocal microscopy of GPNMB and 
aSyn revealed colocalization between GPNMB 
and aSyn in LAMP1+ cytoplasmic punctae when 
transfected in both HeLa (Fig. 4A) and HEK293 
(fig. S2) cells. Moreover, when expressed in both 
cell types, GPNMB and aSyn coimmunoprecipi- 
tated, regardless of whether the immunoprecip- 
itated protein was GPNMB or aSyn (Fig. 4, B 
and C, and fig. $2). 

Encouraged by these data that suggest that 
a GPNMB-aSyn protein-protein interaction 
might form the basis for the profound effects 
on aSyn caused by loss of GPNMB, we next 
investigated the functional interaction of these 
two proteins in a cellular model of PD. Specif- 
ically, when aSyn is fibrillized and introduced 
in the culture medium to neurons, including 
iPSC-Ns, these fibrillar forms of aSyn are known 
to be internalized by the cells, which will sub- 
sequently develop aSyn inclusions with patho- 
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logical hallmarks, including phosphorylation 
at serine 129 (S129) and decreased solubility 
(26). We thus used this PD model to probe the 
effects of neuronal loss of GPNMB by intro- 
ducing fibrillar aSyn to cultures of our GPNMB 
WT, Het, and KO iPSC-Ns. 

Although WT iPSC-Ns at a mature stage of 
differentiation readily internalized labeled pre- 
formed fibrils (PFFs) of aSyn (Fig. 4D, second 
row), we observed a significant reduction in 
aSyn internalization in the Het and KO iPSC-N 
lines, to levels barely above background (Fig. 4, 
D and F). Thus, GPNMB is necessary for the 
robust internalization of aSyn into iPSC-Ns. 

To test whether GPNMB is also sufficient for 
aSyn internalization, we turned to a cell line that 
does not internalize aSyn at baseline. In HEK293 
cells, GPNMB is expressed at very low levels 
endogenously and, in the absence of exogenous 
GPNMB expression, aSyn was not taken up by 
cells (Fig. 4E, top row). However, expression of 
GPNMB, but not of a control transmembrane 
protein (TMEMI106B), conferred the ability to 
internalize aSyn (Fig. 4, E and G). Thus, GPNMB 
is sufficient for the internalization of aSyn. 


GPNMB expression is permissive for the 
development of aSyn pathology in neurons 


Having demonstrated that GPNMB is needed 
for robust internalization of fibrillar aSyn in 
iPSC-Ns, we next investigated whether loss of 
GPNMB would rescue the development of 
aSyn pathology in these cells. To test this, we 
introduced aSyn PFFs into the culture medium 
of GPNMB WT, Het, and KO iPSC-N lines at 
the same time point as the internalization ex- 
periments (day 14 after neuronal induction) 
and then continued to culture the neurons for 
14 more days after the one-time addition of 
fibrillar aSyn. At the end of this period, in WT 
neurons, as expected, we observed abundant 
examples of S129-phosphorylated aSyn (Fig. 5A, 
second row), often along neuronal processes 
(Fig. 5C). In GPNMB Het or KO neurons, how- 
ever, minimal phospho-aSyn staining was found 
(Fig. 5A). By extracting soluble proteins from our 
neuronal cultures, we found that WIT GPNMB 
iPSC-Ns developed insoluble aSyn aggregates 
to amuch greater degree than GPNMB Het or 
KO neurons (Fig. 5B, second row). These insol- 
uble aggregates had a dense appearance and 
were often found in a perinuclear location 
(Fig. 5E). Moreover, quantifications of S129- 
phosphorylated aSyn, with (Fig. 5F) or without 
(Fig. 5D) first extracting soluble proteins, revealed 
a >70% reduction in hyperphosphorylated aSyn 
species in the GPNMB Het and KO neurons. 
Thus, GPNMB functions in the development 
of PD pathology through mechanisms related 
to the cellular internalization of fibrillar aSyn. 


GPNMB measures reflect clinical status in PD 


Findings from our human eQTL and ASE analy- 
ses and iPSC-N experiments support a model 
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Fig. 3. Transcriptomic profiling of iPSC-Ns with heterozygous or homozygous 
loss of GPNMB reveals altered synaptic biology. (A) Heatmap showing 
clustering of iPSC-Ns (day 14 after neuronal induction) based on the top 35 genes. 
N = 5 replicate samples from three independent neuronal differentiations per 
GPNMB genotype. (B) PCA using all expressed genes. (€ to E) Volcano plots for 
pairwise comparison of GPNMB KO cells compared with WT (C), GPNMB Het cells 
compared with WT (D), and GPNMB KO cells compared with Het (E). The horizontal 
axes indicate the logs fold-change (logsFC) in gene expression. The vertical axes 
indicate the —logig of Benjamini-Hochberg—adjusted P value (Paqj), with dotted lines 
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indicating a Pag = 0.01 significance threshold. (F and G) Two coexpression 
modules (M1 and M2) differentiated WT iPSC-Ns from GPNMB Het and KO iPSC- 
Ns. The top 10 enriched pathways for M1 (F)—decreased in GPNMB Het and 
KO iPSC-Ns—and M2 (G)—increased in GPNMB Het and KO iPSC-Ns—are listed. 
(H and I) Network analysis of genes in modules M1 (H) and M2 (I) constructed 
using the GeneMANIA (25) protein-protein interaction dataset. Each point 
represents a gene, with edges showing protein-protein interactions or coexpression. 
The most connected genes, or hubs, are labeled, with teal representing coexpression 
hubs and maroon representing protein-protein interaction hubs. 


5 of 13 


RESEARCH 


RESEARCH ARTICLE 


(neuronal struc 


A°C (row 3) or a 


Diaz-Ortiz et al., Science 377, eabk0637 (2022) 


control) or warmed to 37°C for 90 m 
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-1% Triton X-100 


pS129 Intensity 


Fig. 5. GPNMB is necessary for development of aSyn pathology in iPSC-Ns. 
(A and B) Untagged human aSyn PFFs were added to iPSC-Ns at DIV14. Neuronal 
cultures were maintained for 14 additional days to allow for development of 

aSyn pathology before cells were washed, fixed, and stained under conditions 
without 1% Triton X-100 (TX) extraction (A) or after extraction of soluble proteins 
with 1% TX (B). 81a antibody against aSyn phospho-S129 (pS129) was used 

to stain pathology (green). Neurons were also stained for nuclear DNA (blue) and 
tubulin (gray), although this structural protein was largely removed under 1% 

TX extraction conditions. Scale bar, 50 wm. For each set, a negative control 
condition without addition of aSyn PFFs is shown in the top row, followed by addition 


in which higher GPNMB expression levels con- 
fer risk for PD through interactions with aSyn 
that are permissive for neuronal internaliza- 
tion of fibrillar aSyn. Because the extracellular 
domain of GPNMB is known in cancer con- 
texts to be shed and to circulate in bodily fluids, 
we explored whether GPNMB might be de- 
tected in plasma and cerebrospinal fluid (CSF) 
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HET PFF 


the 


after extraction of soluble 


samples from 731 PD and 59 NC individuals 
(Fig. 6; also see demographic data in tables S4 
to S7). Furthermore, we sought to understand 
whether GPNMB measures in biofluids from 
living subjects might reflect either rs199347 
genotype or disease. 

Echoing our results from eQTL and ASE 
analyses in human brain samples (Fig. 1), we 
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of aSyn PFFs in WT (row 2), GPNMB Het (row 3), and GP 
Only the WT iPSC-Ns demonstrate abundant pS129 staining. (€ and E) The pS129 
staining shows a neuritic pattern [(C), inset from (A)] in the absence of 1% TX 
extraction and a dense perinuclear aggregate pattern [(E), inset from (B)] 

after extraction of soluble proteins, only in the WT iPSC-Ns. (D and F) Quantification 
of pS129 pathology scores is shown under conditions without 1% TX (D) and 
proteins with 1% TX (F). N = 30 images from three 
independent differentiations (shown as blue, pink, and black dots) per line. Boxplots 
depict median, 25th, and 75th quartiles, and whiskers depict full ranges. Groups 


were compared by Mann-Whitney test. ****P < 0.0001; *P > 0.05. 


TX4insoluble p$129 Intensity 


MB KO (row 4) iPSC-Ns. 


found that rs199347 genotypes associated with 
GPNMB levels in CSF in the direction pre- 
dicted by GTEx (J0), whereas no protein quan- 
titative trait locus (pQTL) effect was observed 
in the plasma (Fig. 6, A and B). Notably, how- 
ever, plasma GPNMB levels were elevated in 
PD individuals compared with NCs (Fig. 6C), 
and this elevation persisted after adjustment 
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Fig. 6. Biofluid GPNMB protein levels associate with rs199347 genotype, 
PD diagnosis, and disease severity. (A and B) Plasma (A) and CSF (B) 
GPNMB levels grouped by rs199347 genotype. Genotypes were compared with 
Kruskal-Wallis test, followed by post hoc Dunn's test. **P < 0.01. (€ and 

D) Plasma (C) and CSF (D) GPNMB levels grouped by diagnosis. PD (red) 
versus NCs (blue) were compared by Mann-Whitney test. Boxplots depict 


for age, sex, and rs199347 genotype (Bppvsnc = 
0.141, P = 0.0397; tables S8 and S9). Moreover, 
in a subset of 96 PD individuals with clinical 
severity measures (Fig. 6E), plasma GPNMB 
levels were associated with disease severity 
as measured by the Unified Parkinson’s Dis- 
ease Rating Scale Part III (UPDRS-PIID score 
(27), with higher plasma GPNMB levels found 
in more severely affected PD patients (Fig. 6F 
and table S10). 

We confirmed these results in a second 
cohort of PD individuals from whom we had 
previously obtained GPNMB plasma mea- 
sures using an aptamer-based method (28). 
Despite differences in measurement plat- 
form, disease duration, and severity between 
the original cohort of 96 individuals and this 
cohort of 95 additional, nonoverlapping PD 
individuals, GPNMB levels replicated their 
association with disease severity (Fig. 6G and 
table S11). 
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Discussion 

Using statistical genetic methods, direct mea- 
sures of ASE in human brain samples, CRISPR- 
based manipulation of gene levels in iPSC-Ns, 
and human biomarker studies, we establish 
GPNMB as a PD risk gene. Moreover, using a 
PD cellular model, we nominate GPNMB as a 
potential therapeutic target for interrupting the 
transmission of pathological aSyn. Finally, we 
demonstrate the importance of GPNMB expres- 
sion levels, with even partial loss of expression 
exerting profound effects on aSyn and the syn- 
apse and increased expression associating with 
clinical severity in PD. Our results support a 
model in which PD risk-associated haplotypes 
at the rs199347 locus result in increased ex- 
pression of GPNMB at the RNA and protein 
levels. Increase in GPNMB expression, in turn, 
may confer risk for development of PD through 
effects on the synapse and specifically on the 
cellular uptake of pathological forms of aSyn. 
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median, 25th, and 75th quartiles, and whiskers depict full ranges. **P < 0.01. 
(E) Demographics for discovery and replication cohorts used for UPDRS-PIII 
analysis. (F and G) Scatterplots showing positive correlation between 
UPDRS-PIIl scores and plasma GPNMB values in discovery (F) and replication 
(G) cohorts. Spearman's rho and P value are displayed. RFU, relative 


In this work, we integrated computational 
analyses of large databases with direct exper- 
imental verification of key predictions from 
these analyses. For example, statistical colocal- 
ization techniques and eQTL analyses suggested 
that the rs199347 haplotype affects GPNMB ex- 
pression levels, and direct measures of GPNMB 
ASE in human brain samples confirmed these 
effects. Similarly, transcriptomic profiling of 
GPNMB-edited iPSC-Ns revealed broad effects 
on the synapse, with aSyn emerging as a hub 
protein from network analyses of genes affected 
by loss of GPNMB, and our cell biological studies 
confirmed a functional GPNMB-aSyn_ interac- 
tion with repercussions for the cellular trans- 
mission of pathological aSyn. 

Additionally, we integrated the use of patient- 
derived samples with bench-based manipula- 
tion in a model system. The failure, to date, of 
many cell- and mouse model-based findings to 
translate to clinical utility in PD underlines the 
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importance of verifying that biological path- 
ways implicated in model systems show cor- 
responding signals in human disease. We used 
~800 human samples to ground our study in 
clinical relevance. Direct manipulation of 
GPNMB in iPSC-Ns then allowed us to estab- 
lish a causal effect for GPNMB in cellular pro- 
cesses that may mediate PD pathology. 

What remains to be seen is whether GPNMB 
interacts with both normal and pathological 
conformations of aSyn to a similar extent and 
whether our neuronal findings will translate 
in vivo. A detailed analysis of the interaction 
between these two proteins, including whether 
the interaction with aSyn occurs with the sol- 
uble form of GPNMB versus the transmem- 
brane protein before cleavage, would be key to 
downstream attempts at targeting. However, 
the discovery of a protein, expressed at the cell 
surface, which is both necessary and sufficient 
for internalization of fibrillar aSyn in neurons, 
opens up therapeutic avenues in PD. 


Materials and methods 


A more detailed version of the materials and 
methods is provided in the supplementary 
materials. 


RNA isolation and library preparation from 
human brain samples 


Human postmortem brain samples from NCs 
(n = 2) and PD (n = 4) individuals were ob- 
tained from the University of Pennsylvania Cen- 
ter for Neurodegenerative Disease Research 
(CNDR) Brain Bank and dissected as previ- 
ously described (29). Demographics are sum- 
marized in table SI. The regions analyzed 
included the caudate nucleus, cingulate gyrus, 
and cerebellum. Samples were made up of 
predominantly gray matter. Genotypes for the 
brain samples were obtained as previously 
described (29, 30). 

Library preparation was performed with the 
KAPA RNA HyperPrep Kit (KR1350, lumina 
Platforms, Roche, WI). Briefly, RNA was frag- 
mented using heat and magnesium for 8 min 
at 94°C to obtain 100- to 200-base pair (bp) 
fragments. The first cDNA strand was then 
synthesized using a thermocycler, followed by 
the second strand synthesis combined with 
A-tailing. Unique adapters were then ligated 
onto the library insert fragments and ampli- 
fied using high-fidelity, low-bias polymerase 
chain reaction (PCR). The products then under- 
went a bead-based cleanup for purification 
of the cDNA libraries. 


SeqCap RNA ChoiceTM probe pool design 


The SeqCap RNA probe design pool was de- 
signed by Roche Sequencing Solutions Custom 
Design (Roche, WI). The pool was designed to 
contain probes ~60 bp in length with no more 
than 20 close matches in the genome, as de- 
termined by the Sequence Search and Align- 
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ment by Hashing Algorithm (SSAHA), for the 
purposes of providing sufficient coverage of 
transcripts of interest while minimizing poten- 
tial off-target effects. A close match was defined 
as any genomic sequence that differed from 
one of the probe sequences by five or fewer 
single-base insertions, deletions, or substitu- 
tions. The majority of included probes had no 
off-target matches, with an exceedingly small 
percentage of probes displaying very few off- 
target matches. Only exonic probes covering 
the transcripts of interest were included and, 
to minimize SNP-mediated capture bias, probes 
that overlapped any SNPs in linkage dis- 
equilibrium (7” > 0.2, where 7° is the coeffi- 
cient of determination) with the sentinel PD 
GWAS SNPs were excluded. 


Target cDNA enrichment and sequencing 


Target cDNA enrichment and sequencing was 
performed as previously described (/4) using 
the SeqCap EZ Accessory kit v2 (07145594001, 
Roche, WI). Briefly, equimolar amounts of 
cDNA libraries were combined for a total mass 
of 1 ug. Each of these libraries underwent 
multiplexed PCR with unique index oligonu- 
cleotides. The libraries were then hybridized 
with capture probes using the SeqCap EZ Hy- 
bridization and Wash Kit (05634261001, Roche, 
WI). Libraries were dried with heat in a vacuum 
and resuspended with hybridization reagents. 
Pooled capture probes for each region were 
added to the resuspended libraries and incu- 
bated for 20 hours in a thermocycler at 47°C, 
with the lid temperature at 57°C. The captured 
multiplex cDNA samples then underwent strin- 
gent washing steps and were amplified using 
ligation-mediated PCR (LM-PCR). These post- 
capture PCR-amplified libraries were pooled 
and sequenced on two lanes of an Illumina 
HiSeq 2500 with 150-bp paired-end reads, 
yielding ~150 million read pairs per lane. 


Read mapping and ASE analyses 


To assess RNA-seq reads quality, we used FastQC 
(31), and for reads quality filtering and trim- 
ming we used Trimmomatic (version 032) (32). 
We ran Trimmomatic to remove low-quality 
fragments in a 4-base-wide sliding window 
(average window quality below PHRED 20), 
and low-quality leading and trailing bases (be- 
low PHRED 10). We also dropped all the reads 
with average PHRED quality below 25, as well 
as reads shorter than 75 bases. Depending on 
the sample, 65 to 80% of reads passed this 
trimming and filtering step, resulting in 6.5 to 
24 million read pairs per sample for mapping. 

To perform unbiased allele-specific read 
mapping to the reference human genome (hg19) 
we applied WASP-STAR pipeline (fig. S7). First, 
we mapped reads with STAR (33), applying 
2-step alignment and filtering them for map- 
ping bias using WASP (34). Before proceed- 
ing with variant calling, we removed duplicate 
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reads using rmdup_pe.py script incorporated 
into WASP pipeline. To call and filter single- 
nucleotide variants (SNVs), we used GATK 
tools: HaplotypeCaller, SelectVariants, and 
VariantFiltration. We obtained allele-specific 
read counts by GATK - ASEReadCounter. To 
filter out intergenic variants, we functionally 
annotated SNVs using VariantAnnotation 
(35) and TxDb.Hsapiens.UCSC.hg19.known- 
Gene (36) R packages. To test for ASE at the 
gene level, we first selected proxy SNPs that 
were highly linked (7? > 0.6) with rs199347 and 
located within a coding region for the gene of 
interest to assign allele of origin. For GPNMB, 
we assigned the allele of origin for each tran- 
script read based on genotype at rs199355, and 
for KLHL7, we assigned the allele of origin based 
on genotype at rs2072368. We then tested for 
allelic imbalance with a beta-binomial mod- 
el with overdispersion using the MBASED R 
package (13). P values were adjusted for false 
discovery rate using the Benjamini-Hochberg 
method (37). 


Colocalization analysis 


Colocalization analysis was performed as pre- 
viously described (17) using the coloc package 
in R (version 4.0-2). The prior probabilities of 
pl, p2, and p12 were set to 1 x 10~*, 1 x 10%, 
and 1 x 10°° respectively. Significant colocali- 
zation was determined as having a PPH4 > 
75%. Sensitivity analysis was performed to de- 
termine whether PPH4 is robust over plausible 
values of p12 (1 x 10~° to 1 x 107°) (38). As- 
sociation plots were generated using Locus- 
Comparer package (version 1.0.0) (39) in R. 


Immortalized cell line transfection and collection 


For co-IP experiments, HEK293 or HeLa cells 
were plated at 700,000 or 1 million cells per 
well, respectively, and 18 hours after plating, 
transfection was performed with 4 ug of each 
DNA construct and Lipofectamine 2000 (Thermo 
Scientific) in serum-free Dulbecco’s minimum 
essential medium (DMEM). Cells were trans- 
fected with either GPNMB-myc-DDK pCMV6- 
Entry (Origene) and 5’ untranslated region (5' 
UTR)-Syn pcDNA3.1+ (obtained from K. Luk) 
for the GPNMB-flag IP or GPNMB untagged 
clone pCMV6-XL4 (Origene) and Syn-Myc 
pcDNA3.1+ (obtained from K. Luk) for the Syn- 
Myc IP. Four hours after transfection, media 
was changed to DMEM with 10% fetal bovine 
serum (FBS), 1% L-Glut, and 1% Pen-Strep. At 
20 hours posttransfection, cells were washed 
in Dulbecco’s phosphate-buffered saline (dPBS) 
and lysed in CHAPS buffer (25 mM Tris, 150 mM 
NaCl, 1 mM EDTA, 1% CHAPS, 5% glycerol, pH 
to 7.4). Lysates were collected and spun down at 
4°C for 30 min at 21380 xg. Bicinchoninic acid 
(BCA) assays (Thermo Scientific) were used to 
determine protein concentrations. 

For immunofluorescence experiments, HEK293 
or HeLa cells were plated at either 100,000 or 
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200,000 cells per well on either PDL-coated or 
untreated 12-mm glass coverslips in a 12-well 
format. 18 hours after plating, the cells were 
transfected with 1.6 ug of each DNA construct 
(GPNMB untagged clone pCMV6-XL4 and 5’ 
UTR-Syn pcDNA3.1+) and Lipofectamine 2000 
(Thermo Scientific) in serum-free DMEM. 


Co-IP 


To immunoprecipitate GPNMB, 300 uL of anti- 
Flag-conjugated beads (Sigma A2220) or mouse 
immunoglobulin G (IgG)-conjugated beads 
(Sigma A0919, used as a negative control) were 
used to immunoprecipitate from 1000 ug of 
lysate from GPNMB-myc-DKK and 5'UTR-syn 
double-transfected cells in CHAPS buffer over- 
night at 4°C. To immunoprecipitate aSyn, 300 pL 
of anti-c-Myc-conjugated beads (Sigma A7470) 
or rabbit IgG-conjugated beads (Sigma A2909, 
used as a negative control) were used to im- 
munoprecipitate from 1000 pg of lysate from 
GPNMB untagged clone and Syn-Myc double- 
transfected cells in CHAPS buffer overnight at 
4°C. After 24 hours, the protein-conjugated 
beads were washed three times with CHAPS 
buffer and the bound protein was competitively 
eluted from the beads using either 250 uM 3x 
flag peptide (Sigma) or 250 mM myc peptide 
(Sigma) for 1 hour at 4°C. 


CRISPR-Cas9 KO of human iPSCs 


Human iPSCs with Neurogi and Neurog2 in a 
bicistronic doxycycline-inducible expression cas- 
sette (22) were used for generation of GPNMB 
KO iPSCs and rapid induction of iPSC-Ns. The 
protocol for generating CRISPR-edited iPSCs 
is summarized in fig. S3 and detailed in the 
supplementary materials and methods. 


Culture and differentiation of iPSC-Ns 


Before differentiation, iPSC cultures were main- 
tained on Matrigel-coated 6-cm tissue culture 
plates with mTeSR1 media and mechanically 
passaged every 4: days with StemMACS passag- 
ing solution XF. The neural induction protocol 
is summarized in fig. S4, based on prior studies 
(40), and detailed in the supplementary ma- 
terials and methods. 


Immunofluorescence 


The media was aspirated from coverslip- 
containing wells, and the cells were fixed in 
2% paraformaldehyde in dPBS for 15 min at 
room temperature. After fixing, the coverslips 
were rinsed five times with dPBS and blocked/ 
permeabilized with blocking buffer (3% bovine 
serum albumin + 0.05% saponin in dPBS) for 
1 hour before incubating in primary antibody 
overnight. Image stacks of 1-um thickness 
were acquired by confocal microscopy (Leica 
SP5) using a 40x oil immersion objective with 
2x zoom. Antibody concentrations are sum- 
marized in the key resources tables in the 
supplementary materials. 
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Image processing and quantification 

Image processing to quantify synaptic pro- 
teins was performed using a Cell Profiler 
pipeline based on Danielson et ai.’s previously 
published work (47). A single slice was chosen 
from each stack to focus our analysis on the 
plane with the most abundant synapsin-1 
staining. TUBB3 images underwent image 
enhancement for neurite-like features and 
were used to calculate TUBB3+ area for nor- 
malization and expanded to generate a mask. 
Synapsin-1 and aSyn images underwent en- 
hancement for speckle-like features, followed 
by object identification and characterization of 
object size/intensity. The total integrated inten- 
sity (i.e., sum for all particles) was used for anal- 
ysis. The pipeline used has been made publicly 
available in the supplementary materials. 

aSyn pathology in iPSC-Ns was also quan- 
tified using CellProfiler. Maximum intensity 
projections of z-stack images were created using 
FIJI. Desired 81a+ objects were identified with 
intensity thresholding and the total area of 81a+ 
objects were normalized with total area of 
DRAQ5+ objects. To quantify MJFR1+ staining 
in HEK293 overexpressing GPNMB, nuclei artifact 
staining was removed by generating DRAQ5+ 
nuclei mask and excluding MJFR1+ speckles 
within the mask. Total area of MJFR1+ objects 
was normalized by total area of DRAQ5+ 
objects. 

A researcher blinded to sample identity was 
provided with randomly ordered maximum 
intensity projection of z-stack images of iPSC-Ns 
treated with AF594-PFF. Scores ranging from 
0 to 3 were given for each criterion: intensity, 
length, and frequency of internalized PFFs. The 
three scores were added and were normalized 
to the average score of WT iPSC-Ns. 


Synaptosome extraction 


Synaptosomes were extracted using Syn-PER 
Synaptic Protein Extraction Reagent (Thermo- 
Fisher) per the user instructions. 


Immunoblotting 


Samples were diluted in 5X concentrated sam- 
ple buffer [10 g Sucrose, 1.85 mL 0.5 M Tris, 
pH 6.8, 1.0 mL 0.1 M EDTA, 1.0 mL of 0.1% 
Bromophenol Blue, 1.0 mL of 0.05% Pyronine 
Yellow, 0.615 g of dithiothreitol (DTT), 10 mL 
of 10% sodium dodecyl sulfate (SDS), and 
adjusted to a final volume of 20 mL with Milli- 
Q Water] and boiled at either 100°C for 10 min 
or 70°C for 15 min for heat-sensitive proteins 
(GPNMB). Samples were run on 4 to 20% poly- 
acrylamide TGX gels (Bio-Rad Laboratories) and 
transferred onto 0.2 nitrocellulose membrane 
(Bio-Rad Laboratories). When blotting for aSyn, 
membranes were fixed in 0.4% PFA for 30 min 
(42). Membranes were blocked in 5% milk in 
tris-buffered saline (TBS) for 1 hour and blotted 
overnight at 4°C with specific antibodies. The 
membranes were incubated for 2 hours in 
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horseradish peroxidase (HRP)-conjugated sec- 
ondary antibodies and developed using West- 
ern Bright ECL and Sirius HRP substrates 
(Advansta). Antibody concentrations are sum- 
marized in the key resources tables. Densitom- 
etry was performed using the Bio-Rad Image 
Lab Software. 


RNA-seq of iPSC-Ns 


RNA was extracted from iPSC-Ns on days 0 and 
14 after doxycycline induction using QIAGEN 
RNAeasy mini kit. RNA integrity was measured 
using RNA nano chips (Agilent) on an Agilent 
2100 Bioanalyzer. All samples had RIN > 9.4. 
Library preparation was performed with the 
TruSeq Stranded mRNA Library Prep kit 
(Illumina cat no. 20020595) using 225 ng of 
total RNA and following manufacturer instruc- 
tions. High-sensitivity DNA chips (Agilent) 
were used to balance libraries before sequenc- 
ing. In total, 30 RNA samples [three genotypes 
(WT, Het, and KO) x two differentiation times 
(days 0 and 14) x five biological replicates] were 
sequenced. All 30 samples were sequenced 
together on a single S2 flow-cell generating 
>1.1 x 10° of 100-bp single-end reads in total 
(on average 3.69 x 10’ reads per sample, rang- 
ing from 2.9 x 10’ to 5.3 x 10”). Asummary of the 
Illumina NovaSeq run is provided in table S12. 


Read quality control and filtering 


A quality check of the raw reads was assessed 
using FastQC (37) and summarized with MultiQC 
(43). Next, the adapters were removed and 
filtering of the low-quality reads was con- 
ducted using Trimmomatic, version 0.39 (32), 
with the following parameters: ILLUMINACLIP: 
TruSeq3-SEfa:2:30:10 LEADING:5 TRAILING:5 
SLIDINGWINDOW:5:20 MINLEN:50 AVGQUAL:30 
HEADCROP:10. The adapters that matched 
the sequences provided by TruSeq3-SE.fa were 
removed. The low-quality bases were cut off the 
start and off the end of reads if their PHRED 
score was below 5. The sliding window of 5 bp 
was applied to trim the bases if the window 
PHRED score dropped below 20. The 10 bases 
from the start of the reads were also cropped. 
Finally, the reads were discarded if the average 
PHRED score of the read was below 30. 


Read alignment and quantification 


The sequence alignment BAM files generated 
by Bowtie 2 were used as the input to RSEM 
software tool, to quantify gene expression levels 
(44). On average, 3.61 x 10’ (97.8%) of reads per 
sample passed the filtering steps and were 
mapped versus reference genome GRCh38 
using Bowtie 2 with the following parameters: 
bowtie2 -q—phred33—sensitive—dpad O-gbar 
99999999-—mp 1,1—np I—-score-min L,O,-0.1— 
nofw -p 32 -k 200 -x, as suggested by RSEM 
manual. The read alignment rate was on av- 
erage >89% (table S12). To obtain the raw 
read counts at the gene level, an expectation 
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maximization algorithm (RNA-seq by Expec- 
tation Maximization) was run by the following 
RSEM command: rsem-calculate-expression— 
bowtite2—forwardprob 0. 


Differential gene expression (DGE) analysis 


To detect significant differences in iPSC-N gene 
expression data between day O and day 14 and 
between the genotypes (WT, Het, and KO), 
DESeq function form DESeq2 Bioconductor’s 
R package (45) was used. RNA-seq data were 
modeled using the negative binomial distribu- 
tion that accounts for overdispersion. To con- 
duct the pairwise comparisons (day 14 versus 
day O in WT, KO versus WT, Het versus WT, 
and KO versus Het at day 14) for each gene, the 
nbinomWaldTest statistic was used. The Wald 
test P values were calculated by scaling the 
coefficients by their standard errors and then 
compared with a standard Normal distribu- 
tion. After DGE analysis, the lfeShrink function 
from R package apeglm (46) was applied to 
shrink the log2FC. We corrected for false discov- 
ery rate (FDR) using the Benjamini-Hochberg 
method (37). Genes with a Benjamini-Hochberg- 
corrected P < 0.01 and a |log2 fold-change| > 
1 were considered to be significantly differ- 
entially expressed. To visualize the results of 
DGE analyses we used EnhancedVolcano R 
package version 1.10.0 (47). 


Modular coexpression network analysis 


Modular coexpression network analysis of iPSC- 
Ns was performed using R package CEM?Tool 
(CoExpression Molecules identification Tool), 
version 1.14.1 (48). The rlog normalized expres- 
sion data (15981 genes in 15 samples) were 
used as an input to cemitool function. As a first 
step, cemitool conducted an unsupervised fil- 
tering of expression data using the inverse 
gamma distribution to model the variance of 
genes. Out of 15981 genes, 1100 genes with P < 
0.1 survived this filtering and were used for 
downstream analyses. The genes were then 
separated into modules using the dissimilar- 
ities measures and the Dynamic Tree Cut pack- 
age (49). The minimum number of genes per 
module was set to 20, and similar modules 
were merged together based on correlation. 

To determine biological functions associ- 
ated with the coexpression modules, the gene 
ontology (GO) gene sets c5.all.v7.4, which con- 
tain GO resources (BP, biological process; CC, 
cellular component; MF, molecular function) 
and human phenotype ontology (HPO) from 
MSigDB database (50), were used in overrepre- 
sentation analysis (ORA). The module gene 
set enrichment analysis (GSEA) was per- 
formed using the fgsea R package (57) within 
the CEMitool pipeline. First, a z-score gene 
normalization on all input genes was per- 
formed, followed by calculation of the mean 
for each sample class (GPNMB WT, Het, and 
KO). Next, a preranked GSEA was performed 
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independently for each GPNMB genotype. 
The module activity was visualized by nor- 
malized enrichment score. 

To construct the gene interactions network 
in the coexpression modules, the combined hu- 
man protein-protein interaction data, down- 
loaded from GeneMANIA, were added to the 
CEMiTool object using the function interactions_ 
data. The interactions between the genes were 
visualized using the plot_interactions function. 


aSyn PFF internalization and 
immunofluorescence experiments 


Human WT PFFs and fluorescent fibrils in- 
cluding Alexa Fluor594 (AF594)-conjugated 
aSyn were provided by K. Luk at the Penn 
CNDER. All fibrils were kept at —-80°C until use. 
After the stock of 5 mg/mL was diluted to a 
working concentration of 0.1 mg/mL in dPBS, 
fibrils were sonicated (Diagenode Biorupter 
Plus) on high, sonicating for 30 s followed by 
30 s of rest for a total of 10 min. Sonicated PFFs 
were then added to classic neuronal media ac- 
cording to appropriate concentrations. 

For AF594-PFF internalization experiments 
using iPSC-Ns, 10 ug of PFFs were added per 
well of a 12-well plate. iPSC-Ns were incubated 
in 4°C for 30 min, followed by an additional 
30-min incubation at 4°C for the negative con- 
trol or 1 hour and 30 min in 37°C for experi- 
mental wells. All cells were immediately fixed 
and stained afterward. 

To demonstrate aSyn pathology after PFF 
transduction, we replaced regular media of 
iPSC-Ns at day 14: with classical neuronal media 
containing total of 1 ug of human WT PFFs 
per well. iPSC-Ns were fixed and stained after 
14 days (at day 28 of neuronal induction). To 
extract soluble proteins, coverslips were fixed 
with 2% paraformaldehyde and 1% Triton X-100 
in dPBS. 3% bovine serum albumin without 
saponin in dPBS was used as blocking buffer. 

For WT PFF internalization experiments 
using HEK293, cells were plated at 200,000 cells 
per well in PDL-coated 12-mm glass coverslips in 
a 12-well format. 18 hours after plating, the cells 
were transfected with 1 ug of DNA (GPNMB un- 
tagged clone pCMV6-X1L4 or TMEM106B-Flag) 
and Lipofectamine 2000 (Thermo Scientific) 
in serum-free DMEM. At 20 hours posttrans- 
fection, media was replaced with DMEM con- 
taining 1 ug of PFFs per well. The plates were 
incubated for 30 min at 4°C followed by an ad- 
ditional 30-min incubation at 4°C for negative 
control (to inhibit endocytosis) or 1 hour and 
30 min in 37°C for experimental wells. All cells 
were immediately fixed and stained for im- 
munofluorescence afterward. 


Patient samples and genotypes 


Plasma and CSF samples of PD patients and 
NCs were obtained as part of the Penn CNDR 
Neuropathology, Biomarker, and Genetics Bio- 
bank and the Parkinson’s Disease Molecular 
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Integration in Neurological Diagnosis Initia- 
tive (MIND) studies. PD patients had a clinical 
diagnosis of PD made by a movement disor- 
ders specialist at the Parkinson’s Disease and 
Movement Disorder Clinic (PDMDC) at the 
University of Pennsylvania, whereas controls 
had no known neurological disorder. Data were 
stored in the Penn Integrated Neurodegenera- 
tive Disease Database (INDD) (30). These studies 
were approved by the UPenn Institutional 
Review Board (IORG0000029). Informed con- 
sent was obtained at study enrollment. Partic- 
ipant demographics are reported in tables S4 
to S7. Individuals with known GBA/ mutations 
(N370S, E365K, and L444P) were excluded from 
the analysis because GPNMB elevation has been 
reported in patients with Gaucher disease, a 
lysosomal storage disorder caused by homozy- 
gous mutations in GBAI (52, 53). Genotyping of 
SNP rs199347 was performed by Infinium Global 
Screening Assay (Illumina), NeuroX genotyping 
platform, PANDoRA (Sequenom) panel (30), or 
MIND panel based on allele-specific PCR per- 
formed using FlexSix Dynamic Array integrated 
fluidic circuits (Fluidigm) and genotyping using 
BioMark HD system (Fluidigm) (54). 

A subset of PD individuals from the pre- 
viously described University of Pennsylvania 
U19 Cohort (55) had UPDRS-PIII scores for 
disease severity. 


Enzyme-linked immunosorbent assay (ELISA) 


GPNMB protein levels within human plasma 
and CSF samples were measured with ELISA 
kits (R&D systems) according to manufacturer’s 
instructions. CSF and plasma samples were 
diluted by factors of 1 in 2 and 1 in 30 respec- 
tively to obtain optical density measurements 
within the standard range. All samples were 
run in duplicates and absorbance at 450 nm 
was determined by a microplate reader (Berthold 
Technologies, Tristar LB 941). Only duplicate 
samples with a coefficient of variation (CV) of 
<25% were retained for analysis, and the av- 
erage CV across all samples used was 3.3%. 
Moreover, replicate samples assayed by ELISA 
on different days, by different operators, across 
multiple freeze-thaw cycles, demonstrated ex- 
cellent reproducibility (Pearson’s correlation 
coefficient 7 = 0.97). 


Multiple linear regression analyses 


Multiple linear regression was used to deter- 
mine factors that are significant predictors of 
GPNMB concentration in plasma and CSF. 
Age, sex, PD status, and rs19934:7 genotype were 
included as independent variables. A codomi- 
nant genetic model, which considers each allele 
combination (GG, AG, and AA) as a separate 
factor (56), was used to model the effect of 
rs199347. Regression coefficients were calcu- 
lated with GG as the reference. 

The relationship between GPNMB levels 
and disease severity was established in two 
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different PD cohorts. GPNMB values for the 
discovery cohort were generated by ELISA 
(R&D systems), whereas GPNMB values for 
the replication cohort were generated using 
an aptamer-based platform (26). UPDRS-PII 
values were downloaded from the Penn INDD. 


Statistical analyses 


Statistical analyses were performed with either 
PRISM or R. Data were tested for normality 
with a Shapiro-Walk test. Data with more than 
two categorical groups were analyzed by either 
one-way analysis of variance (ANOVA) (followed 
by post hoc Tukey or Dunnett tests) or Kruskal- 
Wallis test (followed by post hoc Dunn’s test) 
depending on the data’s distribution. Data 
with only two groups were analyzed by either 
a Welch’s two-sample ¢ test or Mann-Whitney 
U test depending on the data distribution. 
Correlations were determined using Spearman’s 
rank correlation. Multiple linear regression 
(glm) was used to determine associations be- 
tween two variables adjusting for potentially 
confounding covariates. Outlier determina- 
tion was performed using ROUT method, Q 
(FDR) = 1%. 


Immunohistochemistry 


Formalin-fixed, paraffin-embedded cingulate 
and temporal cortex samples were obtained 
from the Penn CNDR. Patient demographics are 
reported in table S1. 6-um sections were cleared 
in xylenes and a descending EtOH series. En- 
dogenous peroxidases were quenched in 30% 
HO, and 70% MeOH solution for 30 min. 
Slides were microwaved in citric acid Antigen 
Unmasking Solution (Vector Laboratories). 
After cooling, slides were rinsed in TBS-T (0.1 M 
Tris Buffer/0.05% TWEEN) and blocked (TBS-T/ 
2%FBS/3%BSA). Sections were incubated over- 
night at 4°C in the primary antibody (see the 
key resources tables for antibody conditions). 
Once washed with TBS-T, sections were in- 
cubated for 1 hour at room temperature in the 
secondary antibody (see the key resources 
tables for antibody conditions). VECTASTAIN 
ABC Standard (Vector Laboratories) was ap- 
plied for 1 hour at room temperature followed 
by ImmPACT DAB (Vector Laboratories). Sec- 
tions were counterstained with Harris Hema- 
toxylin (Thermo Scientific) for 40 s. Slides 
were dehydrated in an ascending EtOH series 
and xylenes then coverslipped with Cytoseal 
(Thermo Scientific). 


Dephosphorylation and deglycosylation 


For deglycosylation of cell and brain lysates, 
5.0 uL of PNGase F was added to 125 ug of brain 
lysates or 60 ug of cell lysates and incubated 
at 37°C for 30 min. For dephosphorylation 
experiments, 1 uL of Lambda protein phos- 
phatase was added to 125 ug of brain lysates 
or 50 ug of cell lysates and incubated at 30°C 
for 30 min. 
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EVOLUTIONARY ECOLOGY 


Metabolic scaling is the product of 


life-history optimization 


Craig R. White*, Lesley A. Alton, Candice L. Bywater}, Emily J. Lombardi, Dustin J. Marshall 


Organisms use energy to grow and reproduce, so the processes of energy metabolism and biological 
production should be tightly bound. On the basis of this tenet, we developed and tested a new theory 
that predicts the relationships among three fundamental aspects of life: metabolic rate, growth, 
and reproduction. We show that the optimization of these processes yields the observed allometries 
of metazoan life, particularly metabolic scaling. We conclude that metabolism, growth, and 
reproduction are inextricably linked; that together they determine fitness; and, in contrast to 
longstanding dogma, that no single component drives another. Our model predicts that 
anthropogenic change will cause animals to evolve decreased scaling exponents of metabolism, 
increased growth rates, and reduced lifetime reproductive outputs, with worrying consequences for 


the replenishment of future populations. 


etabolism and production (growth and 
reproduction) affect every aspect of 
biology, but attempts to understand 
the relationships between these pro- 
cesses often take very different ap- 
proaches. Broadly speaking, two schools of 
thought are applied to understand the inter- 
play between metabolism and biological pro- 
duction (usually growth): metabolic theory 
and life-history theory. Metabolic theories 
assume that the exchange or transport of wastes 
or nutrients across surfaces and through dis- 
tribution networks constrains patterns of energy 
allocation to tissue synthesis (J-4). Metabolism 
and body size trajectories therefore emerge from 
these theories as a consequence of assumed 
physical and chemical constraints on resource 
acquisition or usage. Life-history theory seeks 
to understand ontogenetic trajectories through 
the lens of evolution. It assumes that total 
resource allocation to survival, growth, and 
reproduction is limited, and thus allocation 
to each life-history trait is constrained by 
trade-offs among traits that have emerged as 
an outcome of evolutionary selection (5, 6). 
Metabolic and life-history theories each 
have their own strengths and weaknesses. A 
strength of metabolic theories is that they seek 
to explore the relationship between metab- 
olism and growth from first principles and 
are explicit in terms of mass and energy. How- 
ever, these strengths can also be a weakness if 
they are applied with a rigid emphasis on the 
role of intrinsic, absolute constraints without 
incorporating the capacity of these constraints 
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to evolve in response to selection. A strength of 
the life-history approach is that it focuses on 
the ultimate driver of biological production, 
evolution. An important weakness of this ap- 
proach is that, for simplicity, overall rates of 
energy expenditure and net allocation to 
production are themselves assumed to be 
evolutionarily inert. It is increasingly clear, 
however, that metabolism itself is evolution- 
arily labile and subject to selection (7). Here, in 
contrast to metabolic and life-history theories, 
we propose that the invocation of constraints 
is unnecessary to explain the ontogenetic tra- 
jectories of metabolism and growth. We explore 
this idea by modeling how the relationships 
among metabolism, growth, and reproduction 
emerge from selection to maximize lifetime 
reproduction. 

Our approach builds on decades of work 
investigating how metabolism (and its rela- 
tionship to body mass), growth, and repro- 
duction have coevolved (8-78). Our approach 
does not assume that life is unconstrained by 
physics and chemistry, but instead assumes 
that there is substantial (and underappreci- 
ated) opportunity for evolutionary optimiza- 
tion within these constraints. For example, 
the evolution of vascular distribution networks 
has allowed organisms to increase in size be- 
yond the physical and chemical constraints im- 
posed by the slow diffusion of respiratory gases 
through tissue. Similarly, the large and complex 
surfaces of lungs, gills, and guts demonstrate 
how evolution has increased the capacity for 
exchange of resources beyond the Euclidian 
geometric constraints imposed by simple sur- 
face area-to-volume relationships. Here, we 
apply that same logic but expand it to con- 
sider how whole-organism metabolism might 
coevolve with the other fundamental compo- 
nents of life: growth and reproduction. 


A model for metabolism, growth, and 
reproduction 

Our model is built upon an energy-expenditure 
budget for an animal in which the total rate 
of energy expenditure (Ey) is equal to the 
sum of the rates of energy allocation to self- 
maintenance (F),), production (Fp), digestion 
(Ep), and activity (E,). Ep is equal to the sum 
of the rates of energy allocation to growth (Eg) 
and reproduction (ER). All energy expenditure 
values are expressed as joules per hour. The 
overhead costs of production continue when 
an animal is postprandial (i.e., no longer spend- 
ing energy on digestion) (19); therefore, for 
an inactive postprandial animal when both 
Ey, and Ep are zero, 


Ey =Ey+ Ep =Em+£o+ FR (1) 


The scaling of rates of energy expenditure 
(metabolic rates) with body mass (m, in 
grams) is usually well described by a power 
function E = agm", where ay, is the scaling 
coefficient and b, is the scaling exponent, which 
is typically <1 for Ey and Fy (7, 13, 20, 21) and =1 
for Ep (22, 23). Following Day and Taylor (24), 
we partitioned total production among growth 
and reproduction, with allocation to growth 
occurring early in life and growth ceasing 
when all of production is allocated to repro- 
duction (e.g., Fig. 1A). Assuming that total 
production scales allometrically (24) with an 
exponent equal to bg, = bz,,, it follows that 
animals allocate a fraction (f) of Ey; to pro- 
duction [f = [Ey - Ey,]/E7)]. Growth rate (in 
grams per hour) is the change in m over time 
(t) (dm/dt) and is dictated by Fg and a term 
that represents the overhead cost of tissue 
biosynthesis and serves to convert from units 
of energy to units of mass (C,,, in joules per 
gram). Before reproductive maturation at size 
Mat; Ex = 0 and all of production is allocated 
to growth. Growth rate is then calculated by 
rearranging Eq. 1 to calculate Eg, convert- 
ing Eg to the mass of tissue synthesized by 
dividing by C,,, and substituting scaling rela- 
tionships for Ey as follows: 


dm/dt = fE~/Cm = faz,m?™/Cy (2) 


After maturation, production is divided among 
growth and reproduction, and growth rate can 
be calculated by rearranging Eq. 1 to calculate 
Eg (i.e., subtracting Ep from f of Ey allocated 
to Ep) and substituting scaling relationships 
for both Fy and ER 


dm/dt = [fEy — Eg]/Cm 
= [faz,m?* —_ ay, m?* | /Cm (3) 


Growth ceases at maximum mass (MV, in 
grams), when all of production is allocated 
to reproduction (i.e., fag,.M°r = ag,M"*s), 
SO dp, can be estimated from f, dz,, bz,, Dr,; 
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ONTOGENETIC TRAJECTORIES OF SIZE (GROWTH), REPRODUCTION, AND METABOLISM: 
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Fig. 1. Division of energy between growth and reproduction yields 
ontogenetic growth trajectories. (A) Size-dependent allocation of energy to 
production (Ep), which is partitioned among growth (E,) and reproduction 
(Er) and is equal to the difference between total metabolic rate (Er, solid line) 
and maintenance metabolic rate (Ey, dashed line). Ep is represented by the 


Predicted 


Growth performance 


Annual reproduction 


Metabolic level 


Scaling exponent 


Fig. 2. Life history covaries with metabolic scaling. Predicted and observed 
effects of metabolic level and the metabolic scaling exponent on growth 
performance (A to D) and annual reproduction (E to H). Metabolic level (a, in 


milliliters of oxygen per hour) and the metabolic scaling exponent (b) are 
b 


estimated from the scaling of metabolic rate (E) with mass (m) where E = am”. 


Dashed lines in (A), (B), (E), and (F) depict the directionality of predictions 
derived from the model described in the text (see fig. S1 for further details). 
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blue shaded area bounded by the dotted and dashed lines, and Eg is represented 
by the red shaded area. (B to D) Example fits of the growth model to 
mass-for-age data for Atlantic horse mackerel (B), leopard geckos (C), and 
gray field slugs (D) (25). Error bars in (B) are standard errors for the mean mass 
of animals of identical age. 
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Solid lines in (C), (D), (G), and (H) are empirical relationships estimated from 
phylogenetic generalized least-squares models including mass and temperature 
as covariates (tables S1 and S2). Points are shown adjusted for the effects 

of all other predictors in the model. Phylogenetic heritability (A) was significantly 
greater than zero, and the effect of metabolic level and the metabolic 

scaling exponent was significant for all models (P values are shown in the 
relevant panel, along with sample size, n). 
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Fig. 3. Allometric scaling of metabolic rate max- 
imizes lifetime reproduction. (A) Predicted effects 
of metabolic level (a¢,, in arbitrary units) and the 
metabolic scaling exponent (be,) on lifetime 
eproduction (scaled from O to 1), where metabolic 
ate scales with mass (m) as ae,m’e, (B to 

E) Sensitivity of the optimal value of be, (the value 
that maximizes lifetime reproduction) to variation 
in key parameters describing growth and repro- 
duction over the life of a hypothetical organism, 
including: f, the fraction of total metabolism that is 
allocated to production (growth and reproduction) 
(B); C;, a constant linking ae, with life span 

(C); be,, the scaling exponent of reproductive 
output (D); and Mmat/M, the ratio of mass at 
maturity to maximum mass (E). Shaded areas in 
(B) to (E) represent percentile ranges of the 
empirical distribution of be, (Fig. 4), with the darkest 
shade bounding the 10th and 90th percentiles, 

the intermediate shade bounding the 5th and 

95th percentiles, and the lightest shade bounding 
the 2.5th and 97.5th percentiles. The dashed line in 
(D) is the line of unity where be, = be,. 


and M by rearrangement. By substituting 
dz, = faz,M°™ /M°. into Eq. 3 and rear- 
ranging, growth rate can be estimated as 
follows: 


am = Fas, In — Mr (=) "] (4) 


The integral of Eq. 4 yields a growth trajectory 
but has no closed-form solution and so must 
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be solved by numerical approximation (e.g., 
Fig. 1, B to D). Our approach is philosophically 
similar to that of Day and Taylor (24) (e.g., 
their equation 10) but differs in that we do 
not fix the scaling exponent of production. 
Instead, we assume that production is propor- 
tional to metabolic rate and set the scaling 
exponent of production equal to the scaling 
exponent of metabolic rate. We formulated 
Eq. 4 using a scaling relationship for repro- 
duction so that it can be easily parameterized 
using empirical data (e.g., 22). We note that 
although the model is formulated for animals 
that exhibit indeterminate growth, it can also 
accommodate determinate growth. Determi- 
nate growth occurs if animals allocate all of 
production to reproduction immediately upon 
reaching maturity (in model terms, determi- 
nate growth can arise if the scaling exponents 
of metabolism and reproduction are equal, or 
if size at maturity is equal to M). 


Conditions that optimize life history 


To explore the relationships among metabolic 
scaling parameters (az, and b;,), growth, and 
reproduction statistically, it is necessary to 
derive an estimate of az, that is not correlated 
with bz. To do so, we define metabolic level 
(L) as the metabolic rate estimated at a com- 
mon mass (7') that results in a correlation 
of 0 between logioL and b (L = ag,m’™) (25). 
Defined in this way, L represents a mass- 
independent estimate of the elevation of the 
metabolic scaling relationship (i.e., animals 
with a high LZ have a high Ey at m’), and bz, 
represents the slope of the relationship be- 
tween log,,)Fr and log; 9m. If the assumptions 
stated above hold, then Eq. 4 makes four 
testable predictions, which are illustrated in 
Fig. 2 and further elaborated in fig. S1: (i) For 
a given metabolic scaling exponent b;,, growth 
will be positively correlated with metabolic 
level L (Fig. 2A); i.e., animals with a higher 
metabolic level grow faster [e.g., (17, 26, 27, 28)]. 
Gi) For a given L, growth will be negatively 
correlated with bz, (Fig. 2B); i.e., animals with 
shallower metabolic scaling relationships allo- 
cate more energy to production early in life 
and therefore grow faster. (iii) For a given b;,, 
maximum energy allocation to reproduction, 
Er». (=ER when m = M), will be positively 
correlated with L (Fig. 2E); i.e., animals with a 
higher metabolic level allocate more energy to 
reproduction [e.g., (29, 30)]. (iv) For a given L, 
ER, Will be positively correlated with bg, (Fig. 
2F); ie., animals that have steeper metabolic 
scaling relationships allocate more energy to 
production later in life and therefore have 
higher maximum reproduction. 

To test these four predictions, we compiled 
intraspecific metabolic scaling and life-history 
data from the literature and analyzed them in 
a phylogenetic generalized least-squares frame- 
work (25). The dataset includes new compila- 


tions of ontogenetic scaling relationships ex- 
pressed as a function of live mass or wet mass, 
growth trajectories, annual reproduction, and 
longevity (32). We restricted our compilation 
of growth data to animals that grow after 
maturation and show growth patterns that 
are reasonably well approximated by the von 
Bertalanffy growth equation. We calculated 
growth performance, a size-independent esti- 
mate of growth rate, as Pauly’s (2) growth per- 
formance index (@) from estimates of maximum 
length (/..) and the growth constant (#) derived 
from fits of von Bertalanffy growth curves to 
length-at-age data using the equation 6 = 
logiok + 2logiol... Life histories for each species 
were characterized by mean growth perform- 
ance, mean annual reproduction, and maxi- 
mum longevity. 

The patterns in the data match all four 
predictions from Eq. 4: Life histories have 
been optimized such that growth performance 
and reproduction both increase with meta- 
bolic level (predictions 1 and 3, Fig. 2, C and 
G), growth performance decreases with the 
metabolic scaling exponent (prediction 2, Fig. 
2D), and reproduction increases with the meta- 
bolic scaling exponent (prediction 4, Fig. 2H). 
The data also show that longevity decreases 
with metabolic level (fig. S2A), in keeping with 
the predictions of both metabolic theory (J, 10) 
and life-history theory (11, 32). Longevity is 
not related to the metabolic scaling exponent 
(fig. S2B). 


Life-history optimization yields allometric 
scaling of metabolism 


After validating the predictions of our growth 
model, we next explored the effects of meta- 
bolic level and the metabolic scaling expo- 
nent to determine whether any combination 
of these parameters maximizes lifetime repro- 
duction. To do this, we used Eqs. 3 and 4 to 
generate growth trajectories and estimated life- 
time reproduction from the time dependence 
of m and the scaling of Ep with m for values of 
m greater than Myat 


Ex = [faz,M?* /M”* | me (5) 


We used a numerical model to estimate growth 
and reproduction through 1 million time steps 
from birth to maximum longevity (25). As a first 
approximation, we modeled animals growing 
from a starting mass of m = 1 to M = 100 and 
assumed that bz, = 1.137 (22, 23) and f = 0.43 
(33). This estimate of f derived from growth 
data is broadly consistent with estimates of 
the energetic cost of egg biomass production 
in female animals [~50% of basal metabolic 
rate (34)]. The cost of gamete production is 
lower for males than females (34), but costs 
associated with mating effort can be higher 
[e.g., (35)], and the overall costs associated 
with reproduction can be similar for males 
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Fig. 4. Metabolic rate scales allometrically with body mass. (A) Funnel plot showing the relationship 
between the intraspecific scaling exponent of metabolic rate and mass range (orders of magnitude) 
over which the scaling exponent was estimated for scaling exponents estimated from the relationship 
between metabolic rate and live or wet body mass. The horizontal solid line is the mean of all 

values (0.743 + 0.017) estimated in an intercept-only linear mixed model with random intercept terms 
for species identity and taxonomic group. Dashed lines are the 95% profile likelihood confidence 
interval (95% Cl) for the mean (0.699, 0.777). (B) Mean observed scaling exponent (shown + 95% Cl) 
from the empirical distribution in (A) compared with a scaling exponent predicted to maximize 
eproduction (Fig. 3A) based on a set of assumptions regarding energy allocation to production, the 
size dependence of reproduction, size at maturity, and longevity, as described in the text. Plastic or 
evolutionary changes in energy allocation to production, the size dependence of reproduction, size at 
maturity, or longevity will yield changes in the scaling exponent predicted to maximize lifetime 
eproduction (Fig. 3, B to E). The error bars associated with the predicted scaling exponent show 

the range of values over which lifetime reproduction is >99% of maximum. 
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Fig. 5. Scaling of metabolic rate (MR) with body mass. Black lines depict intraspecific scaling 
relationships for endotherms (birds and mammals). Colored lines depict intraspecific scaling relationships 
for ectotherms (fishes, amphibians, reptiles, and invertebrates), which are colored by measurement 
temperature from 1.8° to 40°C. 


SCIENCE science.org 


and females [e.g., (36)]. We assumed that 
Mynat = 0.223 M on the basis of data for fish 
showing that the mean length at maturity is 
equal to 0.61 of maximum length (37) and as- 
suming that, on average, mass is proportional 
to length cubed. Finally, we assumed that life 
span is proportional to a constant (C;) divided 
by az, and first report the results of these 
analyses with C; set to an arbitrary value of 
120. The value of C; = 120 was selected because, 
in combination with the other parameters 
drawn from data (bz, = 1.137, f = 0.43, Mmat = 
0.223 M), this value yields an optimal value 
of the metabolic scaling exponent (0;,.) that 
is broadly consistent with the mean value ob- 
served in the data (Figs. 3 and 4). We then used 
sensitivity analyses to explore how variations 
in the values of bz,, f Mmat, and C; alter the 
optimal value of bz,. 

The model predicts that lifetime reproduc- 
tion varies with the metabolic scaling exponent 
bz, but not with the metabolic level L. Lifetime 
reproduction is independent of metabolic level 
over a wide range (Fig. 3A) that far exceeds the 
approximately three orders of magnitude range 
of metabolic level observed at any given body 
mass in the dataset (Fig. 5). Other factors that 
are not included in our model probably set the 
upper and lower limits of metabolism at any 
given size. Such limits may be imposed by 
resource availability in the environment medi- 
ated by minimum viable population sizes (38) 
or by constraints on the capacity of organisms 
to ingest and process resources or expend 
energy (39). 

A key outcome of our study is that the 
predicted metabolic scaling exponent that 
maximizes lifetime reproduction is almost 
always allometric and falls within the range 
of observed data (Fig. 3; 90% of the observed 
scaling exponents are <1; Fig. 4A). Our initial 
parameter estimates for bz,, f, and Mmat Were 
drawn from published data, but the value of 
C; was selected to yield an estimate of the 
optimal scaling exponent of metabolic rate bz, 
that closely matches the mean scaling expo- 
nent of metabolic rate from the literature com- 
pilation, which is ~3/4 (Fig. 4). However, the 
model does not predict a universally optimal 
bz,, but instead predicts that the value of bz, 
that maximizes lifetime reproduction depends 
on the values of f(Fig. 3B), C; (Fig. 3C), bz, (Fig. 
3D), and Mmat (Fig. 3E). The model therefore 
predicts that evolutionary or plastic changes 
in energy allocation to production, the size 
dependence of reproduction, the size at matu- 
rity, or the relationship between longevity 
and metabolism (and thereby production) 
will yield changes in the scaling of meta- 
bolic rate. This is consistent with empirical 
studies showing that the scaling exponent 
of metabolic rate varies in response to a 
wide range of biotic and abiotic variables 
[e.g., (12, 13, 20, 21)]. 
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Our finding that allometric metabolic 
scaling emerges from the optimization of 
growth and reproduction within a finite 
life span to maximize lifetime reproduction 
offers a potential solution to one of the most 
enduring controversies in biology (20, 21). 
Metabolic theories have usually proposed ex- 
planations for the allometric scaling of meta- 
bolic rate on the basis of the assumption that 
allometry arises as a consequence of physical 
geometric constraints, such as those that are 
imposed by the geometry of resource distri- 
bution networks (3), the need to dissipate 
heat produced as a by-product of metabolism 
(18, 39, 40), and surface area-to-volume con- 
straints on the fluxes of nutrients or wastes 
(7, 18). Others have countered that variation 
in metabolic allometry arises as a conse- 
quence of selection rather than constraint 
[e.g., (7, 9, 12, 15, 18, 21, 41-43)]. Our model 
and data support the latter view, that allo- 
metric scaling of metabolic rate is predicted 
to arise if selection optimizes growth and 
reproduction to maximize lifetime reproduc- 
tion (Fig. 4). Metabolic allometry can there- 
fore be explained without the need to invoke 
any of the assumed constraints tradition- 
ally imposed by metabolic theories [e.g., 
(1, 4, 18, 39, 40)]. 

Allometries are characteristic of life and are 
observed at the level of enzymes, mitochon- 
dria, cells, whole organisms, populations, and 
ecosystems (44-48). Kozlowski and colleagues 
have shown how intraspecific allometries, such 
as those predicted by our model, yield among- 
species metabolic scaling patterns that are also 
allometric (9, 41, 42). Thus, taken together, 
the work of Koztowski and colleagues and our 
study offer a mathematically explicit expla- 
nation for biological allometries at multiple 
scales of organization. Intraspecific metabolic 
allometries arise as a consequence of evolu- 
tionary optimization of growth and repro- 
duction (Fig. 3), and interspecific metabolic 
allometries arise as a consequence of evolu- 
tionary optimization of body size and meta- 
bolic rate [e.g., (9, 41, 42)]. We therefore suggest 
that the allometric scaling of metabolic rate 
arises not because it is inevitable but rather 
because it is advantageous. 


Implications for life histories in 
the Anthropocene 


Elevated extrinsic mortality is a signature of 
the Anthropocene (49). The framework that we 
provide here not only offers new insight into 
the origin of biological allometries but also 
gives us the opportunity to explore and predict 
how metabolism, growth, and reproduction 
will respond to future changes in the envi- 
ronment (25). Our model predicts that organ- 
isms living in a future environment with a 10% 
increase in mortality (an arbitrary but realistic 
figure) are predicted to have substantially re- 
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duced lifetime reproductive output (-36%) and 
to evolve lower metabolic scaling exponents 
(decreased from 0.76 to 0.73) that are associated 
with more rapid growth and earlier matura- 
tion (-1.5%). The substantial decrease in re- 
production predicted by the model, and the 
consequences of this for population replen- 
ishment, is particularly concerning. 


Conclusion 


Our findings suggest that metabolism, growth, 
and reproduction have coevolved to maximize 
fitness (i.e., lifetime reproduction) and that 
the observed patterns in these fundamental 
characteristics of life can be explained by op- 
timization rather than constraint. We offer 
this as an alternative way of viewing the origin 
of biological allometries at all scales, recogniz- 
ing that it represents a profound departure 
from classic thinking. Our viewpoint is analo- 
gous to the concept of the Hutchinsonian 
niche (50): Constraints define the range within 
which life is possible (analogous to the funda- 
mental niche), whereas optimization yields 
the (smaller) range of possibilities exploited 
by life (analogous to the realized niche). We 
reiterate that this viewpoint does not deny 
the existence of biophysical constraints and 
does not call forth Darwinian demons (hypo- 
thetical organisms that maximize all aspects of 
fitness simultaneously and would exist if there 
were no constraints on evolution). Rather, our 
approach expands the phenotypic space in 
which evolutionary optimization operates 
and avoids giving primacy of causation to 
any single pillar of multicellular life. It also 
emphasizes that the pillars of metabolism, 
growth, and reproduction have coevolved 
to shape each other, and, consequently, ob- 
served life-history strategies emerge from 
the optimization of these to maximize life- 
time reproduction within a finite life span 
(8, 9, 15, 41, 42). Within this multivariate 
optimization dwells the great diversity of life 
histories in nature. 
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DEFLUORINATION 


Low-temperature mineralization of 


perfluorocarboxylic acids 


Brittany Trang’, Yuli Li?*+, Xiao-Song Xue*, Mohamed Ateia't, K. N. Houk**, William R. Dichtel’* 


Per- and polyfluoroalkyl substances (PFAS) are persistent, bioaccumulative pollutants found in water 
resources at concentrations harmful to human health. Whereas current PFAS destruction strategies use 
nonselective destruction mechanisms, we found that perfluoroalkyl carboxylic acids (PFCAs) could be 
mineralized through a sodium hydroxide—mediated defluorination pathway. PFCA decarboxylation in 
polar aprotic solvents produced reactive perfluoroalkyl ion intermediates that degraded to fluoride ions 
(78 to ~100%) within 24 hours. The carbon-containing intermediates and products were inconsistent 
with oft-proposed one-carbon-chain shortening mechanisms, and we instead computationally 

identified pathways consistent with many experiments. Degradation was also observed for branched 
perfluoroalkyl ether carboxylic acids and might be extended to degrade other PFAS classes as methods 


to activate their polar headgroups are identified. 


er- and polyfluoroalkyl substances (PFAS) 

are anthropogenic substances containing 

multiple C-F bonds. PFAS are used as 

omniphobic surfactants in many indus- 

trial processes and products, including 
in poly(tetrafluoroethylene) production; as 
water-, oil-, and stain-resistant barriers for 
fabrics and food service containers; and as 
components of aqueous film-forming foams 
for fire suppression (7). As a result of their 
widespread global use, environmental persist- 
ence, and bioaccumulation, PFAS contami- 
nation is pervasive (2) and affects drinking 
water, surface waters, livestock, and agricultural 
products around the world (3). This persistent 
environmental contamination is alarming be- 
cause chronic exposure to even low levels of 
these compounds is associated with negative 
health effects such as thyroid disease, liver 
damage, high cholesterol, reduced immune 
responses, low birth weights, and several can- 
cers (4). Many of these effects have been ob- 
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scured by PFAS manufacturers for decades (5). 
The growing focus on removing parts-per- 
billion to parts-per-trillion levels of PEAS con- 
tamination from drinking water supplies has 
produced several PFAS-removal approaches, 
including established adsorbents such as acti- 
vated carbon and ion-exchange resins, as well 
as emerging materials such as cross-linked 
polymers (6, 7). Adsorbents or membrane-based 
separation processes create PFAS-contaminated 
solid or liquid waste streams but do not address 
how to degrade these persistent pollutants. 
PFAS destruction is a daunting task because 
the strong C-F bonds that give PFAS their 
desirable properties also make these com- 
pounds resistant to end-of-life degradation. 
Harsh PFAS degradation methods include 
incineration (8), ultrasonication (9, 0), plasma- 
based oxidation (11), electrochemical degrada- 
tion (72, 13), supercritical water oxidation (74), 
ultraviolet-initiated degradation using addi- 
tives such as sulfite or iron (15-19), and other 
combinations of chemical and energy inputs 
(20) (table S1). Leveraging the reactivity of 
perfluoroalkyl species might, however, offer 
milder alternatives to address the PFAS con- 
tamination problem. 

The opportunity to degrade PFAS at high 
concentrations in nonaqueous solvents has re- 
cently been developed using PFAS adsorbents 
that can be regenerated using a simple solvent 
wash. This development enables the destruc- 
tion of these compounds after they have been 
removed from water resources, which broadens 
suitable degradation conditions beyond dilute 


aqueous environments. Here, we accessed re- 
active perfluoroalkyl anions that are mineral- 
ized under mild conditions by decarboxylating 
perfluorocarboxylic acids (PFCAs), one of the 
largest classes of PFAS compounds, at low 
temperatures in dipolar aprotic solvents (Fig. 1). 
PFCAs of various chain lengths undergo effi- 
cient mineralization in the presence of NaOH 
in mixtures of water and dimethyl sulfoxide 
(DMSO) at mild temperatures (80 to 120°C) 
and ambient pressure. Under these conditions, 
perfluorooctanoic acid (PFOA, 1) is completely 
degraded with >90% defluorination and mini- 
mal formation of fluorocarbon by-products. 
Experimental observations and density func- 
tional theory (DFT) calculations offer strong 
evidence for degradation pathways distinct 
from the single-carbon-chain shortening pro- 
cesses proposed in prior PFAS degradation 
studies (1, 16, 18, 21-23). This reactivity mode 
is immediately promising for PFCA destruc- 
tion and may prove generalizable to other 
PFAS classes as methods to activate their polar 
groups are identified. 


Decarboxylation and defluorination of PFCAs 
in polar aprotic solvent 


Perfluoroalkylcarbanions are easily accessed by 
decarboxylating PFCAs in dipolar aprotic sol- 
vents. In a solution of DMSO and H,O (8:1 v/v) 
at 120°C, PFOA decarboxylates to form perfluoro- 
1H-heptane 2, which phase separates from 
solution as an oil. ‘H, “C, and °F nuclear mag- 
netic resonance (NMR) spectroscopy of the 
isolated oil confirmed the formation of the 
decarboxylated product in high purity (figs. S1 
to S4). This decarboxylation reaction is consis- 
tent with those reported by Kong et al., who 
found that most carboxylic acids decarboxylate 
reversibly in dimethylformamide (24). Zhou et al. 
(25) studied the origins of this reversible car- 
boxylation computationally and determined 
that the lower barrier to decarboxylation was 
fully induced by solvent effects from the polar 
aprotic solvent (figs. $45 and S58). Such re- 
activity has also been observed as a complica- 
tion for analytical standards (26, 27). We found 
that when the same PFOA solution in DMSO/ 
H,O was subjected to the decarboxylation 
conditions but in the presence of NaOH 
(30 equiv), PFOA instead degraded to a mix- 
ture of fluoride, trifluoroacetate ions, and carbon- 
containing by-products (Fig. 2A). Degradation 
also occurred in other polar aprotic solvents 
such as dimethylacetamide and sulfolane but 


19 AUGUST 2022 * VOL 377ISSUE 6608 839 


RESEARCH | RESEARCH ARTICLES 


RFRFRF O RFRF F 
Ba in Raa PP>SSchy? 
FOF FF FFF FFFFF F 
1, PFOA RFR FRE 
rn Xe identified computationally 
cate | F er RE FFF Authentic standard degrades 
reactive perfluoroanion 
RFRFRE 1 
F H NaOH, DMSO/H,0 
F 40°C formate 
F FFF FFF F  tworide C carvonate 
2 98 CF,CO; 10748 gueolete 
Isolated “a. . ae pea 


Authentic standard degrades 


Fig. 1. Overview of degradation pathways identified in this study. Heating PFCAs in polar aprotic 
solvents such as DMSO decarboxylates them to 1H-perfluoroalkanes. When this reaction was performed 
in the presence of NaOH, the PFCA mineralized to fluoride, sodium trifluoroacetate, and nonfluorinated 
carbon-containing products. The 1H-perfluoroalkane underwent the same degradation process at even lower 
temperatures. Computational studies identified the corresponding perfluoroalkenes as likely intermediates, 
and an authentic standard of the seven-carbon perfluoroalkene was competent for the degradation. 


did not proceed in pure water (fig. S20 and 
table S3). ‘°F NMR spectroscopy of reaction 
aliquots collected over 24 hours indicated that 
resonances corresponding to PFOA were no 
longer detectable within 14 hours. Unexpect- 
edly, no resonances corresponding to perfluoro- 
alkyl groups containing between four and 
seven carbons were observed. Resonances cor- 
responding to sodium perfluoropropionate 
(CF3CF,CO,Na) at -81.5 and -118.2 ppm were 
observed just above the baseline within spectra 
of aliquots collected at reaction times shorter 
than 24 hours but were absent in spectra of 
later aliquots (fig. S10). The only prominent 
fluorine resonance in the aliquot sampled at 
24 hours corresponds to sodium trifluoroa- 
cetate (CF;CO.Na, -73.6 ppm; Fig. 2B). Integra- 
tion of this resonance indicated that its intensity 
plateaued at ~4 to 24 hours, corresponding 
to only 7% of the F content and 9% of the C 
content relative to the initial PFOA concentration 
(Fig. 2, A and C). The resonance from CF;CO,Na 
ions eventually decreased in intensity and pre- 
sumably degraded into fluoride, albeit much 
more slowly than the rate of PFOA disap- 
pearance (Fig. 2C, inset). This resonance dis- 
appeared over 300 hours, which we confirmed 
by subjecting an authentic sample of sodium 
trifluoroacetate to the same reaction conditions 
(fig. S24). PFOA degradation is thus rapid and 
forms CF;CO,Na and trace CF;CF,CO.Na as 
the only identifiable perfluoroalkyl-containing 
liquid-phase by-products, each of which contin- 
ues to degrade over extended reaction times. 
Subjecting perfluorooctane sulfonate ions to 
the basic decarboxylation conditions did not 
result in decreasing perfluoroalkyl °F NMR 
integrations or fluoride formation (fig. S19 
and table S3), indicating that decarboxylation 
to the reactive anion intermediate is the key first 
step of the defluorination process for PFCAs. 
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Ion chromatography (IC) indicated that 
90 + 6% of the fluorine atoms originating from 
the PFOA were recovered as fluoride ions after 
24 hours of reaction at 120°C (fig. S29). Con- 
trol experiments showed that the fluorinated 
polytetrafluoroethylene reaction vessels did not 
contribute an appreciable amount of fluoride 
to fluoride recovery (table S3). Fluoride analy- 
ses performed by IC at shorter reaction times 
indicated that fluoride increased proportion- 
ally to the decrease in [PFOA] observed by F 
NMR spectroscopy. This high fluoride recov- 
ery indicates that most of the perfluoroalkyl 
fluorines were defluorinated and mineralized 
rather than being transformed to smaller-chain 
PFAS or being lost as volatile fluorocarbons. 


Degradation of varied PFAS and by-product 
analysis suggest a complex mechanism 


PFCAs with different chain lengths (two to nine 
carbons) were degraded, providing fluoride 
recoveries between 78% and quantitative at 
24 hours for all PFCAs with four or more 
carbons (Fig. 2D). Although the longer-chain 
(C = 4) PFCAs had a degradation profile 
similar to that of PFOA in that their perfluoro- 
alkyl peaks disappeared from the ‘°F NMR 
spectra (fig. S22) and CF;CO, was formed (Fig. 
2D and fig. S23), the destruction of shorter- 
chain PFCAs (C = 2, 3) was slower and appeared 
to occur by different mechanisms. For trifluoro- 
acetate (C = 2), degradation is slow (>6 days; 
fig. S24), likely because the instability of the 
CF; anion (28) hinders decarboxylation, such 
that destruction occurs either more slowly or 
by a different mechanism. The carbanion cor- 
responding to perfluoropropionic acid (PFPrA) 
(C = 3) decarboxylation is similarly unstable 
(28), resulting in degradation faster than tri- 
fluoroacetate but slower than the longer PFCAs 
(fig. S22). Although the PFPrA F NMR peaks 


disappeared completely over 3 days, fluoride 
recovery was lower than in other PFCAs (3.9 + 
1.6%; Fig. 2D). PFPrA, unlike others in the series, 
decarboxylates to form a volatile product; in 
the ‘°F NMR for PFPrA degradation, peaks 
corresponding to CF;CF,H can be identified (figs. 
S11 and S12). Headspace gas chromatography- 
electron-impact mass spectrometry (MS) also 
detected the CF,CF,** fragment in the gas 
phase of the reaction (fig. S40). This finding 
was corroborated by atmospheric pressure 
chemical ionization-MS of a liquid aliquot 
of the reaction that had a prominent peak 
corresponding to CF3;CF, (compare fig. S39 
with figs. S38 and S40). It appears to be more 
favorable to produce volatile CF;CF,H than 
for the C = 3 PFCA to proceed down the de- 
struction pathway; as discussed below, this 
supports our proposal that a y-carbon is nec- 
essary for the major defluorination pathway 
to occur. Previous PFAS degradation studies 
have suggested that PFCAs (or other PFAS 
that are PFCA precursors) degrade through a 
decarboxylation-hydroxylation-elimination- 
hydrolysis (DHEH) pathway in which each 
PFCA is shortened by one carbon each 
cycle, producing successively shorter PFCAs 
(11, 16, 18, 21-23). However, the nonconformal 
degradation of the three-carbon acid and the 
products observed in the °F NMR spectra of 
degradation reactions of PFCAs containing 
four or more carbons in the present study in- 
dicated that degradation instead occurs through 
distinct, non-single-carbon shortening mecha- 
nisms under these conditions. 

The hypothesis that degradation does not 
occur by iterative one-carbon shortening was 
further supported by quantifying the carbon- 
containing by-products formed when PFOA 
was degraded for 24 hours. We examined a 
combination of solution *H and F NMR 
spectroscopy and quantitative *C NMR spec- 
troscopy of the precipitate isolated from the 
reaction and dissolved in D,O. We also per- 
formed ion chromatography on the combined 
solution and precipitate by adding water to 
the reaction mixture until the precipitate redis- 
solved. These measurements accounted for the 
complete carbon balance of the PFOA degra- 
dation (107 + 8 mol% C relative to the [PFOA] ; 
table S4 and fig. S30). Other than the residual 
CF;CO, ions described above, which contin- 
ued to degrade at longer reaction times, no 
other organofluorine compounds were detected. 
Instead, one-, two-, and three-carbon products 
lacking C-F bonds were identified and quan- 
tified. Formate ions were found in solution 
(fig. S9) and in the precipitate, corresponding 
to 2.5 + 0.3 mols formate ions/mol PFOA, as 
determined by combining the formate con- 
centrations measured in the solution and pre- 
cipitate by NMR spectroscopy. This amount 
is consistent with ion chromatography of the 
reaction mixture and redissolved precipitate, 
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Fig. 2. Overall reaction scheme, experiments 
monitoring PFOA and CF3CO,° concentrations 
over the course of the reaction, and summary of 
degradation products from a series of PFCAs 

of different lengths. (A) Heating 0.089 M PFOA in 
8:1 DMSO:H20 with 30 equiv NaOH allowed 90% of 
the initial fluorine to be recovered as inorganic 
fluoride and residual trifluoroacetate with few other 
organofluorine by-products. Formate ions (26 mol %) 
and several other nonfluorinated by-products were 
identified (107 + 8 mol %). (B) °F NMR spectra from 
0 to 24 hours. Peaks corresponding to PFOA 
perfluoroalky! fluorines between -115 and -126 ppm, 
as well as at -80 ppm, disappeared in less than 

24 hours. Trifluoroacetate (-73.6 ppm) appeared 
and disappeared (disappearance shown in inset of 
panel C) more slowly over the course of the 
reaction. (©) Amount of PFOA (purple, solid line) 
and sodium trifluoroacetate (gray, dashed line) 

in the reaction over time. Error bars correspond 
to the standard deviation of three experiments. 
(D) Fluoride recovery was calculated as 

mols fluoride after reaction as detected by ion 
chromatography per mol fluorine in PFCA reactant. 
Formate/PFCA was calculated as mols formate 
as detected by IC after reaction per mol 

PFCA reactant. CF3CO2 /PFCA was determined 
as mols CF3CO.~ as calculated from '°F NMR 
spectroscopy after 24 hours of reaction per mol 
PFCA reactant. All measurements are expressed 
as the average of three trials unless specified 
otherwise, and error is expressed as a standard 
deviation. All reaction times are 24 hours 

unless specified otherwise. “a,” 286 hours, 

single measurement; “b,” 63% + 12% of 

PFPrA starting material degraded after 24 hours. 
(E) Structures of the identified carbon-containing 
by-products. 


which provided 2.1 + 0.2 mols formate/mol 
PFOA. Formate formation and the varying 
amounts of formate produced by PFCAs of 
other chain lengths inspired a deeper mech- 
anistic study (see below). Carbonate ions were 
detected exclusively in the precipitate, corres- 
ponding to 2.1 + 0.3 mols/mol PFOA. The most 
likely source of carbonate ions was from the 
initial decarboxylation step, along with other 
downstream processes that generate carbon 
dioxide or single-carbon products at the same 
oxidation state. Two-carbon products, glyco- 
late ions (0.6 + 0.1 mol/mol PFOA) and oxalate 
ions (0.7 + 0.1 mol/mol PFOA), were found 
in the precipitate, along with three-carbon- 
containing tartronate ions (0.2 + 0.1 mol/mol 
PFOA). The glycolate and oxalate ions were 
identified by C NMR spectroscopy and com- 
parison with authentic standards. Tartronate 
ions were identified by a combination of “C 
and 'H NMR spectroscopy, which were con- 
sistent with literature reports (29) and showed 


SCIENCE science.org 


A 8:1 DMSO:H,0 formate oxalate tartronate 
RFRFREF O NaOH (30 equiv) 31% 18% 4% 
F. qo carbonate glycolate unknown | 10728 
F FFF FF 120°C 26% 15% 5% molt. 
24h CF,CO, 
0.089 M 8% 
Cc 
= v0 
e so 
| th | | ate Z 
&§ 60 
= 
Md 0 4 © 120 0 XO 360 289 RO MO 
4h = 40 Time (h) 
A Me ON eR lt 
° Fer Fer Fer O S RR 
24h Fi Fop Fel F oO 
0 
oe een Seen eee ered, gene aan aupenn sean mane ———j—- 
-74 -76 -78 80-114 -118 -122 | -126 ‘ ct 2 wa Ss 
3 (ppm) Time (h) 
# of _. Fluoride Recovery Formate/PFCA CF,CO,/PFCA 
Carbons “PPrevistion (%) (mov/mol) (mol/mol) 
2 TFA 45.95 0.98 - 
> PFPrA 3.9416 0.1540.18 not observed 
4 PFBA 103.8 +9.5 0.3120.14 not observed 
5 PFPeA 92.64 5.2 0.24 40.18 0.30 
6 PFHxA 78.0 +45 0.85+0.19 0.19 
7 PFHpA 80.549.1 1.02 + 0.08 0.26 
8 PFOA 90.1+5.8 2.07 + 0.22 0.36 
9 PFNA 88.740.9 2.51 40.18 0.39 
AF 9° 
M ron 
Fe OFF cr, 41.0449 0.46 + 0.03 not observed 
Genx 
. RFR eo 
F INH 80.8 +3.3 0.89 + 0.01 0.26 
F FFF F cr, 


the expected correlations in two-dimensional 
NMR experiments (figs. $31 and S32). Finally, 
a small amount of the PFOA carbon balance 
was found in an unknown product, which 
we designated as a secondary degradation 
product derived from the reaction of glycolate 
ions with other intermediates because it was 
formed in greater amounts when glycolic acid 
was included at the beginning of the PFOA 
degradation reaction. Identifying and quan- 
tifying these carbon products has important 
implications for PFOA degradation. First, the 
high recovery of products with no C-F bonds, 
along with the high fluoride ion recovery, 
confirms that these conditions efficiently 
mineralize PFCAs. Furthermore, identifying 
multiple two- and three-carbon by-products 
further implicates mechanisms more compli- 
cated than iterative one-carbon shortening 
processes. 

PFCAs of different lengths degraded by dif- 
ferent pathways, as indicated by the distinct 


patterns in their formate and CF;CO, forma- 
tion. If the chain-shortening DHEH mecha- 
nism were operative, then we would expect 
that resonances belonging to chain-shortened 
species would appear transiently in the “F NMR 
spectra as longer-chain PFCAs speciated into 
a distribution of shorter-chain PFCAs. Instead, 
only °F NMR peaks corresponding to CF;CO.” 
and trace amounts of CF;CF,CO. were de- 
tected, and the following by-product patterns 
emerged. PFCAs containing four or fewer carbons 
did not produce any CF3;CO, , but all PFCAs 
containing more than four carbons produced 
roughly the same substoichiometric amount of 
CF3CO, : ~0.3 equivalents of CF;CO, /mol PFCA. 
PFCAs containing fewer than six carbons did not 
produce substantial amounts of formate (Fig. 2D), 
but PFCAs containing six or more carbons 
produced increasing amounts of formate, with 
C= 6 and 7 producing ~1 equivalent of formate 
per PFCA, C = 8 ~2 equivalents, and C = 9 ~2.5 
equivalents. These observations indicate that 
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Fig. 3. Proposed PFCA degradation mechanism with activation energies 
(AG#, kcal/mol) for each step as calculated at the MO6-2X/6-311+G(2d,p)- 
SMD(DMSO) level. Cycle AD shows a three-carbon shortening of the original 
PFCA of n carbons (‘‘1,” red, top) with one carbon lost as CO2 (converted to 
CO; under basic conditions) and two carbons lost to fluoroacetic acid, which 
readily degrades under these reaction conditions. Pathway B shows the reaction 
that results from the 1,2 addition of hydroxide to the carboxyl carbon of INT6. 


CF;CO, and formate production occur by dis- 
tinct pathways. 

Experiments conducted at near-ambient 
temperatures showed that decarboxylation 
is the rate-limiting step and subsequent de- 
fluorination and chain-shortening steps can 
occur at near-ambient temperature, giving 
experimental insight into the possible mech- 
anism. Substantial defluorination still occurred 
when the isolated PFOA degradation product 
(perfluoro-1H-heptane 2) was subjected to deg- 
radation conditions but heated to only 40°C 
(table S3). PFCAs have historically been decar- 
boxylated by heating PFCA salts in ethylene 
glycol at 190 to 230°C to yield perfluoro-1H- 
alkanes (30) or by pyrolyzing PFCA salts at 
210 to 300°C to yield perfluoro-1-alkenes (3D), 
but dipolar aprotic solvent-assisted degra- 
dation enabled decarboxylation at only 80 
to 120°C, which can be followed by an even 
lower-temperature defluorination. When 2 
was subjected to the basic degradation con- 
ditions, both fluoride and chain-shortened 
PFCAs were observed by IC and °F NMR at 
short reaction times (5 min at 120°C) and low 
temperatures (25 min at 40°C), in contrast to 
reactions starting from the carboxylated PFOA 
at the same conditions, in which no fluoride 
or short-chain PFCAs were formed at short 
reaction times or at low temperatures (table 
83). Degradation of 2 at 40°C for 48 hours 
showed 57% defluorination (table S3). Although 
the insolubility of the polyfluoroalkane stan- 
dard in the DMSO and water solvent precluded 
accurate measurements of its concentration 
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by NMR spectroscopy, the presence of the 
CF3;CO.- 19F NMR peak (fig. $13) indicated that 
the decarboxylated material likely followed 
a similar degradation pathway. In this low- 
temperature experiment, intermediates that 
were not observed in the higher-temperature ex- 
periments became evident; at around -210 ppm, 
a triplet with J = 48 Hz appeared, which cor- 
responds to the fluoroacetate ion (CH,FCOO ; 
fig. S13). The fluoroacetate peak did not appear 
in the higher-temperature degradations be- 
cause it degrades rapidly at those tempera- 
tures, as confirmed by the degradation of a 
pure standard. Temperature-dependent studies 
of the original PFOA degradation reaction 
showed that the reaction slowed slightly when 
the reaction was conducted at 100°C (time to 
[PFOA] = 0 is ~100 hours compared with 16 hours 
for 120°C; figs. S21 and $25 to S27) and slowed 
substantially when lowered to 80°C (>290 hours; 
figs. S21 and S28). Therefore, significant de- 
fluorination of 2 was unexpected at 40°C, 
suggesting that the steps after the decarboxyla- 
tion were low-barrier or barrierless. These ob- 
servations further indicate that degradation 
does not proceed by successive chain shorten- 
ing through iterative decarboxylation steps. 


Computational studies reveal steps 
in defluorination mechanism with 
negligible barriers 


DFT was used to determine the mechanism of 
this degradation reaction. These studies pre- 
dicted that decarboxylation is the rate-limiting 
step of the degradation and that a series of 


Proposed pathways for the conversion of INT14 to INT30, along with pathways 
for nonfluorinated, carbon-containing by-products, are described in fig. S50. The 
alkene INT30 becomes protonated and proceeds through a similar pathway 

as pathway A. At INT35, the aldehyde analog of acid fluoride INT6, 1,2 addition 
to the carboxyl carbon leads to the formation of formate by elimination in 
pathway C, whereas 1,4 addition to the 8 carbon leads back to pathway D. 

All energies are expressed in units of kilocalories per mole. 


low-barrier or enthalpically barrierless reac- 
tions can lead to levels of defluorination con- 
sistent with experimental observations. DFT 
calculations were performed at the M06-2X/ 
6-311+G(2d,p)-"SMD(DMSO) level (see the sup- 
plementary materials for details) and used 
PFOA as the starting point for the calculations. 
This mechanism should also be valid for the 
degradation of straight-chain PFCAs of other 
lengths. After the initial decarboxylation of 
PFOA (compound 1; Fig. 3) at an activation 
energy of about 28 kcal/mol, calculations in- 
dicated that the resulting anion INT1 would 
eliminate a fluoride to become perfluoroalkene 
INT2 (Fig. 3 and fig. $44). Unlike previous 
PFCA degradation mechanisms in the liter- 
ature predicting that the perfluoroalkyl frag- 
ment will hydroxylate after decarboxylation 
(11, 16, 18, 21-23), these computational results 
point to the formation of an alkene followed by 
an enthalpically barrierless hydroxylation of 
the activated electrophilic alkene. Hydroxyla- 
tion of the alkyl fragment INT1, as postulated 
in previous studies, was calculated to have an 
activation energy of 29.7 kcal/mol under our 
study’s conditions after protonation of the 
fragment (fig. S46), whereas formation of the 
alkene INT2 had a barrier of 19.5 kcal/mol, 
followed by a hydroxylation with no enthalpic 
barrier (AG = -44.3 kcal/mol). The highly exo- 
thermic nature of this alkene hydroxylation 
step played a leading role in driving the deg- 
radation, consistent with observations that the 
defluorination and chain-shortening steps of 
the reaction neither have high energy barriers 
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nor lead to the formation of successively shorter 
PFCAs. Accordingly, when perfluoro-1-heptene 
3 (INT2) was subjected to degradation condi- 
tions (table S3), it also degraded to similar 
products even at 40°C, corroborating the com- 
putational prediction and indicating that the 
alkene is likely on the degradation pathway. 
Further, calculations also suggested that the 
hydroxylation is specifically favored at the 
terminal position, because addition on the in- 
ternal side of the alkene had a barrier of 
8.9 kcal/mol (fig. S47). After this alkene hy- 
droxylation (INT4), calculations suggested 
that a series of low- or no-barrier reactions 
occurred, as shown in Fig. 3 and fig. S44. The 
enol can then eliminate another fluoride, form- 
ing o,,8-unsaturated acyl fluoride INT6 through 
retro 1,4-conjugate addition. 

This resulting a,B-unsaturated acid fluoride 
INT6 has two plausible reaction pathways that 
are consistent with the experimental findings: 
a 1,4-conjugate addition that leads to CF;,CO,— 
formation (pathway D) or a 1,2 addition (path- 
way B) that can lead to formate formation 
(pathway C), which together explain the ex- 
perimentally observed by-product distribu- 
tion. Calculations indicated that neither option 
had enthalpic barriers and thus very low free 
energies of activation, indicating that both re- 
actions occurred to some extent (fig. S48). In 
the enthalpically barrierless 1,4-conjugate ad- 
dition (Fig. 3, pathway D, X = F) that leads 
to the formation of shorter PFCAs such as 
CF3CO, , the hydroxide adds to the 8 carbon 
of a,B-unsaturated acyl fluoride INT6, followed 
by an enthalpically barrierless fluoride elimina- 
tion to form 1,3-diketone compound INTS. Hy- 
droxide again adds to this intermediate on 
the ketone carbonyl side to generate INT9, 
which is more favorable than the addition on 
the acyl fluoride side (fig. S49). Finally, frag- 
mentation occurs to generate an equivalent 
of PFCA three carbons shorter than the initial 
carboxylic acid and an equivalent of fluoro- 
acetic acid, which was observed in the exper- 
iments conducted at 40°C (figs. S13 and S16). 
As an example, if five-carbon PFCA perfluor- 
opentanoic acid (PFPeA) went through this 
cycle, it would produce an equivalent of 
carbon dioxide (1 carbon), an equivalent of 
trifluoroacetic acid (2 carbons), and an equiva- 
lent of fluoroacetic acid (2 carbons) by this 
pathway. However, from the experimental 
results, only about 0.3 equiv of CF;CO. were 
produced from PFPeA (Fig. 2D), indicating 
the PFCA degradation does not proceed quan- 
titatively by this process. This pathway also 
does not account for the substantial amounts 
of formate produced in reactions from longer 
PFCAs. 

Formate ion production is explained by a 
pathway stemming from the favorable 1,2- 
hydroxylation product, which provides an 
a,B-unsaturated PFCA (pathway B). As with 
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INT6, there are multiple possible sites for 
hydroxide addition to INT14, either to the a 
(13.6 kcal/mol) or B (12.0 kcal/mol) carbons. 
Possible pathways propagating from both of 
these processes, along with the formation of 
oxalate and other carbon by-products, are de- 
scribed in the supplementary materials (figs. 
S50 to S54). Although both of these pathways 
for the conversion of INT14 to INT30 are 
plausible and supported by computation, the 
possibility of other active mechanisms cannot 
be ruled out. However, both of these hydrox- 
ylations are more favorable than decarboxylating 
the o,f-unsaturated perfluoroacid (22.3 kcal/ 
mol), and both lead to the formation of per- 
fluoroalkene anion INT30. The chain length 
of the alkene depends on which hydroxylation 
pathway the substrate follows, either four car- 
bons shorter than the original chain (1,3 addition) 
or five carbons shorter than the original chain 
(1,4 addition). Calculations showed that per- 
fluoroalkene anion INT30 is protonated rather 
than eliminating a fluoride to generate the 
alkyne (figs. S55 and S56). After the protona- 
tion, hydroxide adds to the alkene, much like 
the first postdecarboxylation step in the first 
proposed pathway. Likewise, o,f-unsaturated 
aldehyde INT35, an analog to the o,,8-unsaturated 
acid fluoride INT6, is generated through 
retro-1,4 addition. At this point, the inter- 
mediate again faces a bifurcation, with oppor- 
tunities for both the 1,4-conjugate addition and 
the 1,2 addition of the hydroxide to the o,f- 
unsaturated aldehyde. Similar to the addition 
to the o,,B-unsaturated acyl fluoride, both of these 
reactions were calculated to have no enthalpic 
barrier (fig. S57). Through the 1,4-conjugate 
addition (Fig. 3, pathway D, X = H; figs. S59 
and S60), the 1,3-diketone compound gener- 
ated will be attacked by hydroxide, followed 
by the same fragmentation as noted before. 
That is, a PFCA and a fluoroacetic aldehyde 
are formed, the latter of which can be trans- 
formed into fluoroacetic acid or be rapidly 
hydrolyzed. However, if INT35 undergoes 1,2 
addition of hydroxide to the o,f-unsaturated 
aldehyde (Fig. 3, pathway C; figs. S55 and S60), 
the resulting aldehyde (INT36) cannot elimi- 
nate a hydride, whereas its acid fluoride coun- 
terpart INT13 can eliminate a fluoride. Instead, 
INT36 can eliminate the entire perfluoroalkyl 
chain, creating an equivalent of formate and a 
one-carbon-shorter alkene anion that can 
either exit the cycle through 1,4-conjugate 
addition or proceed through the cycle again 
to form more formate, thus giving rise to the 
trend of increased formate formation by PFCAs 
of longer chain length. 


Experimental support for the computationally 
determined mechanism 


Our calculations affirmed that decarboxylation 
is the rate-determining step of the degrada- 
tion, and the calculated activation energy of 


~28 kcal/mol is consistent with the experi- 
mentally determined value of 30.0 kcal/mol 
(see the supplementary materials, page 6 and 
table S2). The proposed mechanism is also 
supported by experimental observations of 
CF;CO, and the formate distribution shown in 
Fig. 2D. By this mechanism, CF;CO. was pro- 
duced as a nonstoichiometric by-product, in 
accordance with the observation that only 
~0.3 to 0.4 equivalents of CF;CO, were formed 
per mol PFCA for all PFCAs with C = 5. This 
proposed mechanism also explains why four- 
carbon perfluorobutanoic acid (PFBA) did 
not produce CF;CO, , whereas the five-carbon 
PFPeA did, because PFBA that has gone through 
cycle AD would create FCOO , which will de- 
compose spontaneously to carbon dioxide and 
fluoride (32) or hydrolyze from INTS8 to form 
tartronate. This two-cycle mechanism also 
explains why five-carbon PFPeA produced 
CF;CO, but no formate, because the carbon 
chain is not long enough to go through path- 
way C. The mechanism predicts that the amount 
of formate will increase as the length of the 
initial PFCA carbon chain increases; this was 
also affirmed by experimental results for PFCAs 
of six to nine carbons (Fig. 2D). The formation 
of carbonaceous by-products such as oxalate, 
glycolate, and tartronate is also consistent with 
this mechanism (figs. S50 to S54). Furthermore, 
when conducting reactions with protodecar- 
boxylated perfluoro-1H-heptane 2 or perfluoro- 
1H-hexane S1 (figs. S5 to S8) at 40°C, the 
formation of intermediate products containing 
five- or four-carbon fluorous chains was ob- 
served (figs. $14 and S17), respectively, which 
likely correspond to INT8/INT9 (figs. S15 and 
$18), the intermediate with the highest acti- 
vation energy (25.6 kcal/mol) in this pathway. 
The peaks corresponding to this intermediate 
disappeared as peaks corresponding to the 
five- and four-carbon PFCAs appeared. These 
PFCAs that are shortened by three carbons are 
the logical products of a single-pathway AD 
cycle from their respective starting materials. 
The experimental observations confirm that 
the computed mechanism provides a complete 
model to describe the observations made ex- 
perimentally about this complex degradation. 
We also performed calculations to test pro- 
posed difluorocarbene (fig. S62), perfluoroalkyl 
hydroxylation (fig. S46), and a-lactone (33, 34) 
(fig. S61) mechanisms that had been proposed 
for such degradations, but these were found to 
have barriers too high to be compatible with 
the experimental conditions. 


Generalization of the PFCA destruction 
method to perfluoroalkyl ether carboxylates 


Branched perfluoroalkyl ether carboxylic acids, 
another major class of PFAS contaminants, are 
also mineralized by perfluoroalkyl anion inter- 
mediates. The ammonium salt of hexafluo- 
ropropylene dimer acid (ammonium perfluoro 
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Fig. 4. Proposed mechanism for branched perfluoroalkyl ether carboxylic acid degradation. Pathway 
A (blue) shows the branched CF3 defluorinating in the same manner as PFCAs in Fig. 3. The lack of 
y-fluorines forces formation of 5 through pathway E (orange), as observed by NMR and MS. Calculations 
show the hydroxide-mediated Sy2 that eliminates the perfluoroalkoxide tail in pathway F (purple), leading to 
the formation of a PFCA that is degraded according to the mechanism described in Fig. 3. All energies 


are expressed in units of kilocalories per mole. 


(2-methyl-3-oxahexanoate; also known as FRD- 
902, the trade name GenX, or HFPO-DA in its 
acid form) is a perfluoroalkyl ether carbox- 
ylic acid that was introduced as an industrial 
replacement for PFOA. This compound now 
contaminates water sources such as the Cape 
Fear River, which serves as the primary drink- 
ing water source for >350,000 residents of 
North Carolina (35). For this compound, the 
decarboxylation and branched CF; chain de- 
fluorination occurred at 40°C, an even lower 
temperature than for the PFCAs (fig. S35). This 
finding is consistent with computational re- 
sults indicating that the barrier for GenX de- 
carboxylation is only 20.4 kcal/mol (fig. S63). 
However, because of the presence of the ether 
oxygen in place of the y-carbon, the structure 
was unable to eliminate a y-fluorine and in- 
stead formed perfluoroalkyl ether carboxylic 
acid intermediate 5 through hydrolysis (Fig. 4), 
which built up in solution and was observed by 
both °F NMR and electrospray ionization MS 
(figs. S33, S35, and S42). Further degrada- 
tion occurred at elevated temperatures (80°C; 
fig. S35). Calculations showed that the decar- 
boxylation of this intermediate was unfavora- 
ble (figs. S65); rather, a hydroxide-mediated 
Sy2 with a barrier of 21.9 kcal/mol occurred 
in which the perfluoroalkoxide tail was elim- 
inated (fig. S66). This perfluoroalkoxide formed 
a carboxylic acid (AG* = 21.9 kcal/mol) with 
the same number of carbons as the original 
perfluoroether tail. Because GenX contains a 
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three-carbon tail, it produced the C3 PFCA 
(PFPrA), the degradation of which led to in- 
complete defluorination (41%; Fig. 2D) and 
the formation of CF3;CF,H (figs. $37, S40, and 
S41). These observations are consistent with 
those of the direct degradation of PFPrA (Fig. 
2D and figs. S11, S12, $39, and S40). The ex- 
perimental observations showed that temper- 
atures of 40°, 80°, and 120°C are necessary to 
form intermediate 5, to form the PFCA analog, 
and to initiate PFCA degradation, respectively. 
These temperature steps correspond to the 
calculated energy barriers of 20.4, 21.9, and 
27.7 kcal/mol, respectively (figs. S35, S63, and 
S64). Degradation of a longer perfluoroalkyl 
ether carboxylic acid with a five-carbon per- 
fluoroalky] tail (compound 4; figs. S34, S36, 
S40, and S43) proceeded by a similar mech- 
anism as that of GenX and gave fluoride re- 
coveries consistent with those obtained from 
the five-carbon PFCA PFPeA. These findings 
indicate that perfluoroalkyl ether carboxylates 
also degrade through perfluoroalkyl anion- 
based processes. Intermediates in the degrada- 
tion of 4, as observed by atmospheric pressure 
chemical ionization-MS (fig. S43), corroborated 
the proposed mechanism (Fig. 4). 


Conclusions 


The perfluorocarbon reactivity that we have 
described here leverages low-barrier defluori- 
nation mechanisms to mineralize PFAS at mild 
temperatures with high rates of defluorination 


and low organofluorine side-product forma- 
tion. In contrast to other proposed PFAS deg- 
radation strategies, the conditions described 
here are specific to fluorocarbons, destroy con- 
centrated PFCAs, give high fluoride ion recov- 
ery and low fluorinated by-product formation, 
and operate under relatively mild conditions 
with inexpensive reagents. The proposed mech- 
anism is consistent with both computational 
and experimental results, provides insight into 
the complexity of PFAS mineralization pro- 
cesses, and may be operative but unrecognized 
in other PFAS degradation approaches. This 
demonstration of the reactivity of perfluoro- 
alkyl anions, and the ability to access such 
intermediates efficiently from PFCAs, may 
inform the development of engineered PFAS 
degradation processes and facilitate expand- 
ing this reactivity mode to PFAS with other 
polar head groups. 
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Input-dependent segregation of visual and 
somatosensory circuits in the mouse 


superior colliculus 


Teresa Guillamon-Vivancos, Mar Anibal-Martinez, Lorenzo Puche-Aroca, Juan Antonio Moreno-Bravo, 
Miguel Valdeolmillos, Francisco J. Martini, Guillermina Lopez-Bendito* 


Whereas sensory perception relies on specialized sensory pathways, it is unclear whether these pathways 
originate as modality-specific circuits. We demonstrated that somatosensory and visual circuits are not 

by default segregated but require the earliest retinal activity to do so. In the embryo, somatosensory and visual 
circuits are intermingled in the superior colliculus, leading to cortical multimodal responses to whisker pad 
stimulation. At birth, these circuits segregate, and responses switch to unimodal. Blocking stage | retinal 
waves prolongs the multimodal configuration into postnatal life, with the superior colliculus retaining a mixed 
somato-visual molecular identity and defects arising in the spatial organization of the visual system. Hence, 
the superior colliculus mediates the timely segregation of sensory modalities in an input-dependent manner, 
channeling specific sensory cues to their appropriate sensory pathway. 


n the mature cerebral cortex, sensory mo- 
dalities are segregated into specialized areas 
known as primary sensory cortices. So that 
cortical areas ultimately respond to a spe- 
cific sensory stimulus, this segregation is 
thought to occur during development, and it is 
first instructed by intrinsic factors and later by 
sensory experience (/-5). However, this early 
inception of sensory identity has not been di- 
rectly demonstrated because there is no func- 
tional evidence of the sensory specificity of 
emerging circuits. It remains to be determined 
whether sensory identities arise directly as 
unimodal entities or whether they are initially 
multimodal and become specified over time. 
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We found that the nascent somatosensory 
and visual pathways of mice are functionally 
interconnected, with both cortices respond- 
ing to tactile stimulation at prenatal stages 
yet segregating into independent pathways 
at birth. This segregation happens in the 
superior colliculus (SC) and depends on 
retinal input. Blocking stage I retinal waves 
or removing retinal projections in embryos 
leads to a failure in the timely developmen- 
tal segregation of visual circuits in the SC. 
Consequently, the activation of the visual 
cortex by somatosensory stimuli extends into 
postnatal life and is associated with long-term 
circuit abnormalities. 


S1 and V1 emerge functionally intermingled 


A functional map of the periphery is present in 
the mouse somatosensory cortex at perinatal 
stages, as shown through whisker pad stimu- 


lation in a transgenic mouse line in which 
glutamatergic neurons in the neocortex ex- 
press GCaMPéf (hereafter Cx9°““?*) (6). 
Hence, there appears to be a high level of 
intramodal functional organization of sen- 
sory circuits at these early stages. We used 
the same approach to test whether devel- 
oping sensory circuits also show early speci- 
ficity to stimulus modalities. We found that 
at embryonic stages, a somatosensory stimu- 
lus not only triggered the expected contra- 
lateral response in the barrel field of the 
primary somatosensory cortex (S1) but also a 
bilateral response in the presumptive primary 
visual cortex (V1). This multimodal response 
was observed in 33% of the mice analyzed at 
embryonic day 18 (E18) but disappeared by 
postnatal day 0 (PO) (Fig. 1, A to C, and movies 
S1 and S2), suggesting that we were revealing 
the end of a developmental process. We next 
tested whether the disappearance of this cor- 
tical multimodal response might be related to 
the arrival of retinal inputs to central struc- 
tures, which occurs close to birth (7-9). To 
test this possibility, we used Cr?" mice 
in which their eyes were eliminated bilaterally 
at E14 (embBE mice) (10). Whereas whisker 
pad stimulation at PO triggered only the cor- 
tical somatosensory response in control mice, 
in the embBE mice, this response was consist- 
ently multimodal, activating both S1 and V1 in 
100% of the cases, even at E18 (Fig. 1, D to F, 
and movies S3 and S4). This multimodal re- 
sponse extended into the first postnatal days 
and lasted until P6 (Fig. 1G and fig. S1). Thus, it 
appears that somatosensory and visual circuits 
are not segregated by default but require the 
arrival of retinal input to do so. 

We then designed experiments to identify 
the circuits responsible for this lack of segrega- 
tion and the ensuing multimodal response in 
embBE mice. Electrical stimulation of either S1 
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Fig. 1. Embryonic somatosensory and visual cortices originate as multimodal. 
(A) (Left) Experimental design. (Right) Calcium responses in the cortex (pink) elicited 
by mechanical stimulation of the whisker pad in control mice at E18 and PO. 

(B) Proportion of the whisker pad stimulations that evoked a contralateral or ipsilateral 
response in S1 and V1 of control mice at E18, and in V1 from E18 to P6 (E18, n = 9 
mice; PO, n = 13 mice; P3 to P6, n = 8 mice). (C) Scheme summarizing the results. 
(D) (Left) Experimental design. (Right) Calcium responses in the cortex (pink) elicited by 


or V1 in the embBE mice failed to reproduce the 
bilateral V1 responses to whisker pad stimula- 
tion (fig. S2, A to D). Moreover, these responses 
were still triggered by whisker pad stimulation 
when the barrel field was inhibited by tetro- 
dotoxin (TTX) (fig. S2, E and F). Hence, these 
experiments demonstrate that the sensory- 
modality interconnection in embBE mice is 
not driven by a cortico-cortical mechanism. 


SC circuit reorganization provides 
sensory-modality specificity 

Before reaching the cortex, sensory-modality 
pathways converge in nearby territories of the 
thalamus and also in the SC (77). Therefore, 
the multimodal responses observed might in- 
volve direct communication between modal- 
ities at either of these structures. The SC is a 
midbrain multisensory structure that receives 
visual input in its superficial layers (SSC) and 
somatosensory input in its deep layers (dSC) 
(11, 12). Using mice that express GCaMPé6f in 
SC neurons (hereafter SCC@”?*), we found 
that whisker pad stimulations at PO evoked 
a bilateral response in the sSC of all embBE 
mice, whereas no responses were evoked in 
the sSC of control mice (Fig. 2, A to C, and 
movies S5 and S6). To confirm that the cor- 
tical multimodal response requires the SC, 
we inactivated the SC with TTX in embBE tri- 
ple transgenic mice that expressed GCaMP6f 
in the cortex and SC (Cr-SC°@”"") and found 
that the cortical response became unimodal 
(Fig. 2, D to F). Similar results were found 
when the SC was acutely lesioned (fig. S3, A 
and B). Last, we looked for sensory modality 
changes in the circuits downstream of the SC 
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and, using dye tracing, demonstrated that 
the connectivity from the SC to the thalamus 
was normal in embBE mice (fig. S3, C and D). 
Similarly, we found no differences in the spe- 
cificity of the sensory modality of thalamic 
afferents or efferents in these embBE mice 
(fig. S4). Therefore, these results demonstrate 
that the SC mediates the multimodal cortical 
responses in embBE mice and suggest that 
the SC is involved in the developmental seg- 
regation of sensory modalities observed in 
control mice. 

We next assessed how the SC channels so- 
matosensory information into the presump- 
tive V1 by testing the connectivity between 
the dSC (somatosensory) and sSC (visual) layers 
in embBE mice. At PO, whereas unilateral 
electrical stimulation of the dSC layers in 
SCO™PF control slices only elicited responses 
in the contralateral dSC layer, the responses in 
the embBE mice also propagated to both the 
ipsilateral and contralateral sSC layers (Fig. 2, 
Gand H, and movies S7 and S8), an effect that 
was not observed by stimulating the super- 
ficial layers (fig. S5). In addition, these func- 
tional results were supported by anatomical 
tracing. Unilateral injection of lentivirus- 
expressing enhanced green fluorescent pro- 
tein (EGFP) into the SC produced widespread 
labeling of the contralateral dSC and sSC 
layers in embBE mice, whereas only the con- 
tralateral dSC layer was labeled in their con- 
trol littermates (Fig. 2, I and J). Conversely, 
when the multimodal response finally switched 
to unimodal in the embBE mice at P6, we 
observed a significant reduction, as compared 
with PO, both in the number of axons in the 


mechanical stimulation of the whisker pad in embBE mice at E18 and PO. (E) Proportion 
of the whisker pad stimulations that evoked a contralateral or ipsilateral response in 
Sl and V1 of control and embBE mice at PO (n = 19 control mice, n = 21 embBE mice). 
(F) Scheme summarizing the results. (G) Proportion of the whisker pad stimulations 
that evoked a bilateral V1 response in embBE from E18 to P6 (E18, n = 5 mice; PO, n = 21 


P6, n = 8 mice). Scale bars, 1000 um. Bar graphs show 


the means + SEM. ns, not significant; *P < 0.05, **P < 0.01, ***P < 0.001. 


sSC and in the bilateral V1 or SC activation 
elicited by whisker pad stimulation (fig. S6). 

The data from the embBE suggest that the 
multimodal embryonic response in control 
mice could also be mediated by the SC. Using 
the Cxr-SCC"”"!, we found that whisker pad 
stimulation at E18 in control mice elicited a 
bilateral V1 response in 37% of the cases, 
which was always accompanied by concomi- 
tant bilateral sSC activation. However, acti- 
vation of the sSC was not detected when the 
cortical multimodal response switched to uni- 
modal at PO (Fig. 3, A to C). We then tested 
whether the switch from multimodal to uni- 
modal cortical responses involved a develop- 
mental reconfiguration of the intracollicular 
circuits. As such, we injected a lentivirus- 
expressing EGFP unilaterally into the SC at 
E14 and analyzed the disposition of the axons 
in the contralateral sSC at E18 and PO. We de- 
tected significantly more labeled axons in the 
sSC layer at E18 than at PO (Fig. 3D). This un- 
plugging of the sSC layer from the dSC layer, 
which was delayed in embBE mice, coincides 
with the peak innervation of retinocollicular 
axons (Fig. 3E) (7). These results suggest that 
the arrival of retinal axons to the SC prompts 
the segregation of somatosensory and visual 
circuits. 


Perinatal retinal waves drive the SC 
circuit reorganization 


Developing retinal axons convey a stereotyped 
pattern of spontaneous activity known as reti- 
nal waves. Stage I retinal waves, which are 
mediated by gap junctions, last approximately 
from E17 until Pl in mice (Fig. 3F) (3), and 
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Fig. 2. Somatosensory and visual SC circuits remain intermingled in 
embBE mice. (A) (Left) Experimental design. (Right) Calcium responses (pink) 
in the SC elicited by mechanical stimulation of the whisker pad in control and 
embBE mice at PO. (B) (Left) Quantification of the data shown in (A) (n = 7 
control mice, n = 7 embBE mice). (€) Scheme summarizing the data. (D) Cortical 
and SC calcium responses elicited by mechanical stimulation of the whisker 
pad in embBE mice at PO before, during, and 1 hour after bilateral TTX injection 
in the SC (asterisks). (E) Quantification of the data shown in (D) (n = 6 mice). 
(F) Scheme summarizing the data. (G) (Left) Experimental design. (Right) 
Calcium responses elicited by electrical stimulation of the dSC in control and 
embBE slices at PO. There is spreading to the ipsilateral and contralateral 
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superficial layers of the SC in embBE mice (arrowheads). (H) Proportion of the 
total area (active fraction) of the sSC and dSC layers activated by a perithreshold 
stimulus in the dSC at PO (n = 8 control mice, n = 5 embBE mice). (I) (Left) 
Experimental design. The lentivirus was injected into the SC at E14. (Right) 
Optical coronal sections from three-dimensional (3D) light-sheet images showing 
the EGFP lentiviral-labeled axons in the sSC of embBE mice at PO (arrowhead). 
(J) Quantification of the normalized EGFP expression in contralateral sSC and 
dSC (n = 8 control mice, n = 4 embBE mice). Scale bars, (A) and (D) 1000 um; 
(G) and (I) 150 um. Boxplots show the medians with the interquartile range 
(box) and range (whiskers). The bar graphs show the means + SEM. ns, not 
significant; *P < 0.05, **P < 0.01, ***P < 0.001. 


therefore, we tested whether these waves might 
play a role in directing intracollicular sensory- 
modality segregation. We blocked stage I reti- 
nal waves by injecting the gap junction blocker 
carbenoxolone (cbx) (13, 14) into both eyes of 
CxPCMPF mice at PO. To confirm that cbx 
blocks gap junction-mediated retinal waves 
in vivo, we recorded spontaneous activity in 
V1, the frequency of which was reduced sig- 
nificantly 1 hour after the administration of 
cbx at PO (fig. S7, A and B). This reduction was 
not due to the degeneration of retinofugal 
axons because these axons reached both the 
thalamus and the SC (fig. S7C), and visual 
spontaneous activity was recovered at P6 (fig. 
87, A and B). Accordingly, acute administra- 
tion of epibatidine, a high-affinity cholinergic 
agonist that inhibits postnatal stage II retinal 
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waves (15-17), significantly reduced the fre- 
quency of spontaneous activity in V1 at P6, 
whereas cbx had no such effect at this stage 
(fig. S7, D and E). Altogether, these experiments 
show that most of V1 activity at the perina- 
tal stage in vivo is driven by gap junction- 
mediated retinal waves. 

We next recorded the cortical activity after 
whisker pad stimulation in Cr°©“”” mice 
treated with cbx at PO (Fig. 3G). Seven to 
8 hours after cbx administration, stimulation 
of the whisker pad triggered in all mice a 
multimodal response that was elicited in 44% 
of the trials and that rose to 64% after a second 
injection of cbx on the following day (Fig. 3, G 
to I, and movies S9 and S10). By contrast, this 
multimodal response was not triggered by the 
injection of saline or epibatidine (Fig. 3, G to 


I). Cbx administration to Cx-SCOM”"” mice at 
PO showed concomitant bilateral responses in 
the V1 and SC at P1 (fig. S8, A to C, and movie 
S11), which were abolished by TTX acute injec- 
tion into the SC (fig. S8, D and E). As in the 
embBE, lentiviral tracing in the SC of cbx- 
treated mice showed the abnormal invasion 
of axons from the dSC layer into the sSC layer 
at P1 (Fig. 3J). Last, we checked whether the 
responses to whisker pad stimulation eventu- 
ally switch to unimodal in the cbx-treated mice. 
When we performed the whisker pad stimula- 
tion in the C10" and triple Cr-SCOUr es 
mice, we found that the multimodal responses 
both in the V1 and SC were almost absent by 
P6 (fig. S8, F to I). These data show that the 
timely segregation of somatosensory and visual 
pathways require a perinatal reorganization 
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Fig. 3. Blocking stage | retinal waves prolongs the multimodal configuration. 
(A) Whisker pad stimulation leads to the concomitant bilateral activation of both V1 
and SC in control E18 mice, which switches to unimodal at PO. (B) Quantification 

of the data shown in (A) (n = 8 E18, n = 13 PO). (€) Scheme summarizing the data. 
(D) (Left) Optical coronal sections from 3D light-sheet images showing EGFP 
lentiviral-labeled axons in the sSC of control E18 (arrowheads) but not PO mice. The 
lentivirus was injected into the SC at E14. (Right) Quantification of the normalized 
EGFP expression in contralateral sSC and dSC (n = 9 E18 mice, n = 8 PO mice). 

(E) (Left) Coronal views of retinal axons labeled by CTB injection of the eye at E14 that 
reach the SC at E18 and PO. (Right) Quantification of the data shown at left (n = 9 
E18 mice, n = 6 PO mice). (F) Scheme showing early phases of retinal waves. (G) (Left) 
Experimental design. (Right) Cortical calcium responses at P1 elicited by mechanical 
stimulation of the whisker pad after bilateral injection of (left) saline, (middle) cbx, 
and (right) epibatidine (epb) into the eye at PO. (H) Proportion of the whisker pad 


wps “contra contra 
stimulations that evoked a bilateral V1 response before and after the injection of saline, 
cbx, or epb: time (t) = 0, n = 20 mice injected with saline, n = 24 cbx, n = 17 epb; 

t =1 to 2 hours, n = 4 saline, n = 8 cbx, n = 6 epb; t = 4 to 5 hours, n = 6 saline, n = 
7 cbx, n= 9 epb; t = 7 to 8 hours, n = 7 saline, n = 4 cbx, n = 3 epb; t = 24 hours, n = 19 
saline, n = 14 cbx, n = 13 epb; and t = 32 hours, n = 11 saline, n = 12 cbx, n = 7 epb. 
(I) Scheme summarizing the data. (J) (Left) Optical coronal sections from 3D light- 
sheet images showing lentiviral EGFP-labeled axons in the sSC of P1 mice treated 
at PO with cbx (arrowheads) but not with saline. The lentivirus was injected into the SC 
at E14. (Right) Quantification of the normalized EGFP expression in contralateral 

sSC and dSC (n = 5 saline, n = 6 cbx). Scale bars, (A) and (G) 1000 ym; (D) 200 yum; 
(E) 250 pm; (J) 200 um. Boxplots show the medians with the interquartile range 
(box) and range (whiskers). The bold lines in (H) indicate the mean, and the shading 
indicates the SEM. The bar graphs in (B) show the mean + SEM. ns, not significant; 
*P < 0.05, **P < 0.01,***P < 0.001. 


of intracollicular circuits that depends on gap 
junction-mediated retinal waves. 


Failure to timely segregate drives long-lasting 
circuit alterations 


Although reversed by the end of the first 
postnatal week, the transient expansion of the 
multimodal phase may have long-term effects 
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on the specification of sensory circuits. For ex- 
ample, perinatal blockage of retinal activity 
might cause enduring changes in the tran- 
scriptional program of SC layers. To assess 
this, we performed bulk RNA-sequencing 
(RNA-seq) of the sSC and dSC layers at P6 in 
mice treated with saline or cbx at PO to Pl 
(Fig. 4A). A principal components analysis 


(PCA) revealed that sSC and dSC cells clus- 
tered according to their anatomic origins (fig. 
S9A). Moreover, a differential expression anal- 
ysis (DEA) in saline conditions revealed 1544 
differentially expressed genes (DEGs) enriched 
in the sSC as opposed to 1549 in the dSC layers 
(Fig. 4A and table S1). Among the DEGs en- 
riched in each population, we found genes 
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Fig. 4. Transcriptomic SC alterations following blockade of perinatal retinal 
waves. (A) (Left) Scheme of the RNA-seq experiments of sSC and dSC tissue from 
P6 mice treated with saline and cbx at PO-P1. (Right) Heatmap of normalized, 
regularized logarithm (Rlog) z-score of expression, and unbiased clustering of 
significant DEGs between the sSC and dSC in saline-treated mice. Each row 
represents a gene, and the columns are biological replicates. The color code 


indicates the normalized expression for up-regulated genes in yellow versus down- 


regulated genes in purple. Highlighted genes are those previously identified as 
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layer-specific markers. (B) Volcano plot showing the significance and P value 
distribution of DEGs. The light brown and dark brown dots indicate sSC and 
dSC DEGs, respectively, and the green dots and the percentages indicate layer- 
specific DEGs shifted by cbx, with the top 10 protein coding genes listed in 
every region. (©) Heatmaps of the normalized regularized logarithm (Rlog) 
z-score of expression and unbiased clustering of layer-specific DEGs whose 
expression was modified by cbx in the sSC and dSC regions. (D) Scheme 
summarizing the data. 
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Fig. 5. Long-term alterations in eye-specific segregation and retinotopy 
after blockade of perinatal retinal waves. (A) (Left) Experimental design. 
(Right) Coronal view of the SC showing axons from the ipsilateral (CTB-555) 
and contralateral (CTB-647) eye at P15 in saline- and cbx-treated mice. 

(B) Quantification of the data in (A) (contralateral, n = 10 saline, n = 6 cbx; 
ipsilateral, n = 7 saline, n = 5 cbx). (C) (Left) Experimental design. (Right) 
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Coronal view of the dLG showing back label from Dil and DiD crystals in V1 
at P15 in saline- and cbx-treated mice. (D) Quantification of the data shown 
in (C) (azimuth, n = 7 saline, n = 5 cbx; elevation, n = 6 saline, n = 5 cbx). 
Scale bars, 100 um. The bold lines in (B) indicate the mean, and the shading 
indicates the SEM. Bar graphs show the mean + SEM. ns, not significant; 

*P < 0.05, **P < 0.01. 
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previously identified as layer-specific markers, 
including those encoding transcription factors 
such as Rorb and Barhll in the sSC layer or 
Pou4fl and Pou4f2 in the dSC layer (78, 19). 
To determine whether cbx may influence SC 
layer-specific genes, we compared the RNA- 
seq data between mice that received saline or 
cbx and found that cbx significantly modified 
24% (370 genes) of the sSC-specific genes and 
21.1% (326 genes) of the dSC-specific genes 
(Fig. 4B, fig. SOB, and table S2). Among these, 
cbx induced the expression of a large propor- 
tion of the dSC-specific genes (285 out of 326 
genes) in the sSC layer, and that of sSC-specific 
genes (168 out of 370 genes) in the dSC layer. 
Likewise, it provoked the down-regulation of 
layer-specific gene expression to levels resem- 
bling those in the other layer (Fig. 4C and fig. 
S9, C and D). Together, these results demon- 
strate that the somatosensory and visual mo- 
lecular identity of SC layers was altered by the 
blockade of stage I retinal waves (Fig. 4D). 

The alterations to the modality-specific iden- 
tity of sSC and dSC may somehow affect the 
stereotypic organization of incoming inputs to 
these layers. Within the visual system, retino- 
collicular axons are organized in eye-specific 
segregated clusters in the sSC (7, 20), and there- 
fore, we tested whether eye-specific segregation 
is disrupted in cbx-treated mice. Cholera toxin 
subunit B (CTB) injection into the eyes of peri- 
natally cbx-treated mice (fig. SIOA) showed a 
compromised eye-specific segregation in the 
sSC at P15. Whereas in the saline condition, 
contralateral and ipsilateral axons segregated 
into dorsal and ventral sSC compartments, 
respectively, the ipsilateral axons in cbx-treated 
mice invaded the dorsal compartment and 
mingled with the contralateral axons (Fig. 5, 
A and B, and movies S12 and S13). In addi- 
tion, small dye crystals placed in V1 at the 
azimuth and elevation axis at this stage also 
showed alterations in the fine-scale organi- 
zation of the geniculo-cortical pathway (Fig. 5, 
C and D, and fig. S10, B to D). Together, these 
results demonstrate that early perturbations 
in retinal activity leads to long-lasting circuit 
reconfiguration at central stations within the 
visual pathway. 


Discussion 


Our findings demonstrate that visual and soma- 
tosensory pathways emerge as multimodal 
circuits, and that during perinatal life, they seg- 
regate in a manner that is orchestrated in the 
SC. The multimodal-to-unimodal transition re- 
quires a reconfiguration of intracollicular cir- 
cuits at birth, in which dSC layers disconnect 
from the sSC layers. This reconfiguration drives 
the specificity of sensory circuits, so that 
whisker pad stimulation exclusively triggers S1 
responses in the cortex after birth. The switch 
from multimodal to unimodal requires early 
retinal activity because perturbation of stage I 
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retinal waves prevents the reconfiguration of 
the circuits in the SC and leads to the exten- 
sion of the multimodal phase into postnatal 
life. Therefore, these data broaden our under- 
standing of retinal wave function by revealing 
their instructive role in the acquisition of sen- 
sory modality specificity, which is beyond their 
classic role in the postnatal refinement of vis- 
ual circuits (21-25). 

The segregation of visual from somato- 
sensory systems must occur in a limited de- 
velopmental window because any delay in this 
segregation will cause long-lasting changes in 
the intramodal organization of visual circuits. 
The mechanisms that mediate the final clos- 
ing of the multimodal phase in embBE and 
cbx-treated mice remain unexplored, although 
they may include the influence of passive 
whisking during the first days of life (26) or 
the progressive assembly of inhibitory cells 
into the networks of the SC, as described for 
cortical networks (27). A longitudinal analysis 
of both the molecular identity of collicular 
neurons and the maturation of their con- 
nectivity in physiological and in manipulated 
scenarios will help identify the cell types and 
potential factors involved in this circuit seg- 
regation and plasticity. 

Last, our findings reveal that the SC, a 
highly conserved structure in vertebrates 
(28, 29), participates in the construction of 
brain regions that appeared more recently in 
phylogenetic terms such as the neocortex, 
a role that goes far beyond the SC’s well- 
established sensorimotor and multimodal in- 
tegrative functions (30). From the phylogenetic 
perspective, it has been stated that the visual 
cortex has inherited an increasing number of 
functions from the SC related to the process- 
ing of visual features (37-33). Our results spot- 
light the ontogenetic perspective, in which the 
developing SC exerts a master control on cor- 
tical specification and configuration of visual 
circuits. Thus, we believe that a deeper under- 
standing of the functional development of 
phylogenetically ancient structures is crucial 
to understand how the neocortex is formed 
and its functional areas are specified. 
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Soybean photosynthesis and crop yield are improved 
by accelerating recovery from photoprotection 


Amanda P. De Souza’, Steven J. Burgess’, Lynn Doran’, Jeffrey Hansen’, Lusya Manukyan’, 
Nina Maryn?, Dhananjay Gotarkar”, Lauriebeth Leonelli*+, Krishna K. Niyogi*°, Stephen P. Long'®* 


Crop leaves in full sunlight dissipate damaging excess absorbed light energy as heat. This protective dissipation 
continues after the leaf transitions to shade, reducing crop photosynthesis. A bioengineered acceleration of 
this adjustment increased photosynthetic efficiency and biomass in tobacco in the field. But could that also 
translate to increased yield in a food crop? Here we bioengineered the same change into soybean. In replicated 


field trials, photosynthetic efficiency in fluctuating light was higher and seed yield in five independent 
transformation events increased by up to 33%. Despite increased seed quantity, seed protein and oil content 
were unaltered. This validates increasing photosynthetic efficiency as a much needed strategy toward 
sustainably increasing crop yield in support of future global food security. 


he number of people affected by food in- 
sufficiency continues to grow (1). Projec- 
tions clearly show that food supply is not 
increasing fast enough to guarantee food 
security. As a result, it is expected that 
more than 840 million people worldwide will 
be undernourished by 2030. Providing seed 
that can achieve higher yield per unit land 
area is an effective way to reduce this food 
shortage, provide food security to the poorest, 
and avoid yet more land being brought into 
production. Improving photosynthesis has been 
suggested as a major opportunity to gain the 
needed jump in yield potential (2). 
Plants dissipate potentially damaging excess 
absorbed light energy in full sunlight by in- 
ducing a mechanism termed nonphotochem- 


Fig. 1. Protein expression, yield, and seed 
composition in engineered soybeans. (A) Repre- 
sentative immunoblots comparing the protein abun- 
dance of wild-type (WT) and eight independent 
transgenic soybean lines expressing AtVDE, AtPsbS, 
and AtZEP (VPZ) grown in the field at Urbana, 

IL, USA. Band corresponding to AtPsbS is the 

lower band; 30 yg of total soluble protein was loaded 
per well; Coomassie Blue-stained gel (CB) is 
presented to show uniformity of loading. Immuno- 
blots of all biological replicates are shown in figs. S1 
and S2 for 2020, and in figs. S3 to S9 for 2021. 
(B) Seed yield in tonnes per hectare (t. ha‘) 
obtained at final harvest in 2020. (C) Protein, oil, 
fiber, and ash content in percentage of dry mass of 
seeds in 2020. Given the high similarity between 
biological replicates, statistical differences were 
obtained, even though most were <1% different from 
WT. These statistical differences are for protein 
between WT and ND-18-56, YZ-19-8A, and ND-18-44; 
for oil between WT and ND-18-56, YZ-26-1C, and 
ND-17-20; for fiber between WT and ND-18-56 and 
ND-17-20; and for ash, between WT and ND-18-56. 
Given the small magnitude of difference, these 

are very unlikely to have any physiological meaning. 


ical quenching (NPQ) (3). This process is 
essential in avoiding the formation of reactive 
oxygen species that would damage the photo- 
synthetic apparatus (4). However, NPQ mech- 
anisms are slow to relax following the frequent 
sun-shade transitions that occur within crop 
canopies. This results in a substantial loss of 
photochemical energy (i.e., between 7.5 and 
30%) that could otherwise be used for photo- 
synthesis (5, 6). For soybean crop canopies, 
this slow NPQ relaxation upon sun-shade tran- 
sitions was calculated to cost >11% of daily 
carbon assimilation (7). 

NPQ combines several different mecha- 
nisms that vary in relaxation kinetics during 
sun-shade transitions (8, 9). Energy-dependent 
quenching (qE) is the major and most rapidly 
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induced component, initiated within seconds 
up to a few minutes upon transfer of leaves to 
high light (3, 10). The induction of qE during 
exposure to high light requires acidification of 
the thylakoid lumen to activate violaxanthin 
de-epoxidase (VDE), which converts viola- 
xanthin to zeaxanthin via the intermediate 
antheraxanthin in the VAZ xanthophyll cycle 
(11). When the leaf transitions back to low 
light, the reverse reaction converts zeaxanthin 
back to violaxanthin on a much slower time 
scale via the activity of zeaxanthin epoxidase 
(ZEP) (12). Photosystem II (PSII) subunit S 
(PsbS) is also implicated in qE modulation of 
NPQ (73), causing PSI]-associated antennae to 
undergo a conformational change. Zeaxanthin 
accumulation is also responsible for a more 
slowly induced and relaxing NPQ compo- 
nent that is independent from both PsbS and 
lumen acidification. This component, termed 
zeaxanthin-dependent quenching (qZ), is where 
zeaxanthin binds to light-harvesting complex 
proteins and is induced during the first ~ 8 to 
10 min of high light (3). 
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Fig. 2. Relaxation of NPQ on high- to 
low-light transitions. (A) Relaxation of NPQ 
in the last of five cycles of 4 min under high 
light (2000 umol m~ s7 PFD) followed by 

3 min under low light (200 pmol ms PFD) 
in wild-type (WT) and independent transgenic 
soybean lines expressing AtVDE, AtPsbS, 
and AtZEP (VPZ) grown in the field at Urbana, 
IL, USA in 2020. NPQ throughout the 0.4 
five cycles is shown in fig. S12. Values were 
normalized to NPQ in high light to allow 
easier comparison among lines; points shown 
are means (+1 SE) for each time point 

(n = 8 biological replicates). (B) Mean (+1 SE) 
of the time constants (t) of the fast (t,) 

and (C) medium (t2) relaxing components of 
NPQ in field-grown plants in 2021. Time 
constants were obtained by fitting a double 
exponential curve (8) to the time course of 
NPQ relaxation (n = 16 biological replicates). 
Semi-log plot of the NPQ values used for 
these calculations is shown in fig. S13. 
Asterisks indicate significant difference (**P < 
0.05) between WT and transgenic lines. 


Up-regulation of VDE, PsbS, and ZEP (VPZ) 
in tobacco significantly accelerated the viola- 
xanthin xanthophyll cycle (VAZ), leading to 
faster induction and relaxation of NPQ. This 
increased the efficiency of CO, assimilation 
and electron transport through PSII in fluc- 
tuating light conditions, without altering 
photosynthetic efficiency at steady state. In 
replicated field trials, shoot biomass produc- 
tion was increased 14 to 21% across three in- 
dependent transgenic events (/4). However, a 
study in transgenic Arabidopsis thaliana (At) 
containing the VPZ expression cassette failed 
to show increased biomass accumulation. This 
observed discrepancy was suggested to be due 
to negative effects of faster relaxation on pho- 
toprotection, or the result of feedback con- 
trols related to lack of sink strength (75). 
Others have claimed that increases in biomass 
in VPZ tobacco can be due to reasons other 
than photosynthesis such as changes in hor- 
mone levels (J6) or suggested that sink limi- 
tations will prevent any realization of increased 
seed yield in food crops (77-19). To test whether 
a yield increase could be achieved in a major 
agronomical crop, we introduced the VPZ con- 
struct into soybean [Glycine max (Gm)]. In terms 
of total global production, soybean is the fourth 
most important grain crop and the most im- 
portant single source of vegetable protein. 


Results 
Engineered acceleration of NPQ relaxation 
increases soybean yield 


During the summers of 2020 and 2021, field 
experiments were performed in Urbana, IL, 
USA to test the impact of the VPZ construct 
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on photosynthesis, growth, and seed yield in 
independent transgenic lines of T, and T; 
homozygote progeny of soybean (cv. Maverick). 
Maverick is an elite soybean genotype that is 
resistant to stress conditions and has been 
used as a proof of concept of different trans- 
genes (20-22) because of its amenability to 
genetic transformation. 

AtVDE, AtPsbS, and AtZEP protein and 
mRNA were detected in all transgenic lines 
in both years and at different stages of plant 
development (Fig. 1A, figs. S1 to S9, and table 
S1). The PsbS antibody detected both native 
GmPsbS and the transgenic A¢PsbS proteins. 
The AtPsbS protein was observed as a lower 
band just below 23 kDa on a Western blot, 
which was absent in wild-type (WT) samples 
(Fig. 1A and figs. S1 to S9). Transgene ex- 
pression was variable with the lowest abun- 
dance observed in YZ-26-1C and ND-18-44 
lines in both protein and RNA levels (Fig. 1A 
and table S1). As might be expected, the ratio 
of increases in each of the three proteins ap- 
pears more important than the total increase 
(fig. S10). For instance, YZ-26-1C had low over- 
all increases and showed a 21.7% yield in- 
crease, whereas ND-17-20 had high amounts of 
protein without a significant increase in yield 
(Fig. 1, A and B). This is consistent with the 
mathematical simulations that show that the 
ratio of expression of VDE, ZEP, and PsbS is 
more important in achieving altered NPQ 
kinetics than is absolute abundance (fig.S10). 

In 2020, the overexpression of VPZ trans- 
genes led to a significantly higher seed yield 
(Fig. 1B). Of eight independent transgenic 
lines, five showed a significant increase in seed 


yield, and none showed a lower yield. The yield 
increase averaged across these five transgenic 
lines was 24.5% compared to the WT with the 
largest difference being observed for the line 
ND-18-34A (+33%). Higher yields were due 
to a higher seed number per plant (fig. S11A). 
Except for the line YZ-19-21 that showed greater 
seed mass, seeds from transgenic lines were 
overall smaller than WT (fig. S11B). The same 
trend was observed during 2021 on number 
and size of seeds (fig. S12, A and B), although 
those differences did not translate into a sig- 
nificantly higher seed yield at the end of the 
2021 season (fig. S12C). Increased seed pro- 
duction in 2020 was achieved with less than 
1.3% change in protein and oil content, and 
most other components of seed quality (Fig. 1C 
and table S2). Similar results were obtained 
for seeds produced in 2021 (tables S2 and S3). 

Did the overexpression of AfVDE, AtPsbS, 
and A?tZEP (i.e., VPZ genes) lead to a faster 
NPQ relaxation and higher photosynthetic ef- 
ficiency in soybean? As observed in tobacco 
(14), the overexpression of these three trans- 
genes in soybean accelerated the relaxation of 
NPQ during sun-to-shade transitions (Fig. 2), 
resulting in increased photosynthetic efficiency 
under fluctuating light (Fig. 3). The difference 
in NPQ relaxation was assessed by comparing 
the normalized NPQ values at different time 
points or the time constant of NPQ compo- 
nents after the transition from high to low 
light. In both years, NPQ relaxation was faster 
in all transgenic lines (Fig. 2). Between 90 and 
300 s after the sun-shade transition, reduction 
in NPQ across transgenic lines during 2020 
was 11 to 23% greater than in WT (Fig. 2A). 
Similarly, in 2021, the fast (t,) and medium 
(t2) relaxing time constants for NPQ relaxa- 
tion were 37 to 67% smaller in the transgenic 
lines than in WT; i.e., relaxed more rapidly 
(Fig. 2, B and C, and fig. S13). 

Although all transgenic lines showed faster 
NPQ relaxation rates, the amplitude of NPQ 
varied between lines. For instance, ND-18-34A 
and YZ-19-21 had significantly higher NPQ 
under high light, whereas YZ-26-1C had NPQ 
values similar to those of the WT under the 
same conditions (fig. S14). The increased NPQ 
levels under high light did not confer any ben- 
efit to the plants, because the photoprotec- 
tion index (23) did not differ throughout the 
day between any of the VPZ lines and WT 
(fig. S15). 


Effects of the engineered change beyond yield 


The overexpression of VPZ genes in soybean 
led to an overall reduction in levels of vio- 
laxanthin in ND-18-34A and YZ-19-21 with a 
concomitant increase in zeaxanthin in the 
periods of high light exposure (i.e., 2 p.m.) 
(figs. S16, A and C, and S17). Levels of anthe- 
raxanthin in all transgenic lines did not 
change relative to WT (fig. S16B). Differences 
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Fig. 3. Photosynthetic efficiency in steady-state and fluctuating light. (A and D) Quantum efficiency of 
COz assimilation (®COz) under steady-state light. (B and E) Quantum efficiency of linear electron transport 
(®PSIl) under steady-state light. (© and F) Average NPQ corresponding to A - B and D - E, respectively. 
Panels (G to L) are, respectively, the equivalent measures under fluctuating light. Data are for wild-type (WT) 
and soybean lines expressing AtVDE, AtPsbS, and AtZEP (VPZ) grown in the field at Urbana, IL, USA during 
2020 and 2021. Bars are the mean (+ 1 SE) for measurements made during vegetative and seed-filling stages 


(n = 16 biological replicates). Asterisks indicate signifi 


cant differences between WT and VPZ transgenic 


lines (**P < 0.05; *P < 0.1). The separate values for each developmental stage are listed in table S4. 


in zeaxanthin accumulation altered the de- 
epoxidation state (DES) in the VPZ lines. 
Whereas YZ-26-1C had a lower DES through- 
out the day, ND-18-34A and YZ-19-21 showed 
an increase in DES at mid-day in relation to 
the WT, but a faster decrease in DES in the late 
afternoon (fig. SI6D). In both soybean and 
tobacco, VPZ expression was associated with 
faster epoxidation of zeaxanthin, faster t and 
Tz recovery, and less residual NPQ after tran- 
sition from excess light to limiting light. Ap- 
parently, unlike tobacco, soybean has a second 
xanthophyll cycle, the lutein epoxide cycle 
(LxL cycle), which converts lutein epoxide into 
lutein during exposure to high light. The LxL 
cycle is catalyzed by the same enzymes as the 
VAZ cycle (24) and similarly contributes to 
the photoprotection mechanism, but is typi- 
cally present only in trace amounts in those 
plant species examined (24-26). Here, over- 
expression of VPZ genes increased lutein epo- 
xide levels relative to WT with a concomitant 
decrease of lutein and an average reduction of 
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5% in DES related to the lutein epoxide cycle 
(LxL DES; fig.S18). Introduction of the LxL 
cycle in Arabidopsis generated plants that had 
increased photosynthetic efficiency over the 
WT (27). Thus, activation of the LxL cycle in 
the VPZ soybean could potentially contribute 
to the phenotypes observed in the VPZ trans- 
genic lines. However, epoxidation of lutein is 
significantly slower than that of zeaxanthin 
(28, 29) and would not explain the observed 
faster relaxation of NPQ. 

Changes in photosynthetic efficiency were 
evaluated through the response of photosyn- 
thesis to different light levels under steady- 
state and fluctuating light conditions (Fig. 3). 
At steady state, there were no differences be- 
tween transgenic lines and WT in quantum 
efficiencies of CO. assimilation (®CO,) or of 
linear electron transport (®PSIJ) in 2020 (Fig. 
3, A and B) and 2021 (Fig. 3, D and E). Under 
fluctuating light, CO, was significantly in- 
creased by 1 to 10% relative to WT across the 


two years (Fig. 3, G and J). In parallel, PSII 


in fluctuating light was increased 2.5 to 11.5% 
(Fig. 3, H and K). These increases were con- 
sistent with those observed previously in to- 
bacco (7/4). Consistent with the increase in 
@®CO, and PSII under fluctuating light, NPQ 
values in these fluctuating light conditions 
averaged 21% lower in 2020 (Fig. 31) and 17% 
lower in 2021 (Fig. 3L) relative to WT. In- 
creased photosynthetic efficiency did not alter 
plant growth, plant development, or stem bio- 
mass (figs. S19 to S21). Carbon and nitrogen 
ratio (C/N) and phosphorus levels in leaves 
during vegetative and reproductive stages or 
in seeds were not different between WT and 
transgenic lines (fig. S22), suggesting that the 
increase in photosynthetic efficiency did not 
alter nutrient contents per unit mass. 


Weather may affect the yield response 


So, how did the faster NPQ relaxation and 
increased photosynthetic efficiency promoted 
by the overexpression of VPZ genes lead to a 
higher seed production in 2020? Accelerating 
the rates of NPQ relaxation can only benefit 
productivity when there are fluctuations in 
light (Fig. 3) (74). Thus, it is expected that VPZ 
plants exposed to a larger number of sun-to- 
shade transitions during their growth will 
have an advantage over plants exposed to a 
lower number of those transitions. As evi- 
denced by our 2020 field season findings, the 
VPZ lines significantly outperformed WT 
while yield gains were lacking in 2021. Strong 
interannual variation in seed production has 
been observed in other transgenic yield traits 
(30), so what may explain the difference be- 
tween years in our study? First, in 2021, a se- 
vere storm just before the critical stage of seed 
filling lodged the crop, causing upper leaves 
to lie on lower leaves in the canopy, leading 
many to be permanently shaded and so, lack- 
ing the sunflecking that would have occurred 
in an upright canopy. Second, sun-shade tran- 
sitions caused by intermittent cloud cover and 
of the duration in which photosynthesis would 
benefit from the bioengineered increase in 
NPQ relaxation were 20% fewer in 2021 com- 
pared to 2020 (fig. S23). 

The number of pods produced by VPZ plants 
shows further evidence that the leaves of the 
middle canopy are important for the observed 
increase in yield in VPZ soybean. The number 
of pods in soybean is determined during the 
period from beginning of flowering to the be- 
ginning of seed filling (32) and shows a linear 
and positive correlation with canopy photo- 
synthesis (32, 33). At harvest in 2021, trans- 
genic plants had on average 13% more pods 
than WT (fig. S24F), indicating that prior to 
the lodging, VPZ soybeans were on track for 
a higher seed yield than WT plants in terms of 
pod numbers. However, at harvest, the num- 
ber of pods with four seeds, typically the high- 
est number, was lower in the transgenic lines 
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compared to WT, whereas the number of 
0-seed and 2-seed pods was higher (fig. S24, A, 
C, and E). Although the number of seeds per 
pod in soybean has high heritability (34), re- 
duced number of seeds per pod and increased 
abortion occur when photosynthate availabil- 
ity becomes limiting (35, 36). 

In conclusion, the overexpression of AfVDE, 
AtPsbS, and AtZEP in soybean promoted ac- 
celeration of NPQ relaxation with a concomi- 
tant improvement in photosynthetic efficiency 
under fluctuating light over 2 years in field 
trials. By accelerating NPQ relaxation, an aver- 
age seed yield increase of 24.5% across five 
independent events was observed in 2020. 
This is equivalent to the average biomass in- 
crease achieved previously for tobacco plants. 
This demonstrates that under field conditions, 
direct bioengineering of increased photosyn- 
thetic efficiency leads to increased yield in 
replicated plots in a major food and feed crop. 
Further, in line with agronomic practice, no 
nitrogen fertilizer was added to the soybean 
crop. Yet, more seed was produced without 
any reduction in protein, nitrogen, and oil con- 
tent, showing this as a means to the sustainable 
increases in yield urgently needed to help en- 
sure future food security. 
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THERMOELECTRICS 


Flexible thermoelectrics based on 


ductile semiconductors 


Qingyu Yang*+, Shiqi Yang’7+, Pengfei Qiu’?*, Liming Peng’, Tian-Ran Wei‘, Zhen Zhang”, 


Xun Shi??*, Lidong Chen* 


Flexible thermoelectrics provide a different solution for developing portable and sustainable flexible 
power supplies. The discovery of silver sulfide—based ductile semiconductors has driven a shift in the 
potential for flexible thermoelectrics, but the lack of good p-type ductile thermoelectric materials has 
restricted the reality of fabricating conventional cross-plane -shaped flexible devices. We report a 
series of high-performance p-type ductile thermoelectric materials based on the composition- 
performance phase diagram in AgCu(Se,S,Te) pseudoternary solid solutions, with high figure-of-merit 
values (0.45 at 300 kelvin and 0.68 at 340 kelvin) compared with other flexible thermoelectric 
materials. We further demonstrate thin and flexible x-shaped devices with a maximum normalized 
power density that reaches 30 pW cm~ K~. This output is promising for the use of flexible 


thermoelectrics in wearable electronics. 


earable electronics are widely used in 
communication, medicine, health care, 
and other areas, but their usage is 
greatly limited by the low capacity 
and short lifetime of power supply 
from chemical batteries (7). Therefore, portable, 
reliable, super-thin, and sustainable flexible 
power supplies are in great demand (2-4). 
Flexible thermoelectric (TE) generators have 
the advantages of small volume, no moving 
parts, continuously working in all weather 


conditions, and high reliability. They can di- 
rectly convert the heat from the human body 
to electricity with a temperature gradient of 
several kelvins (or less) through the Seebeck 
effect, providing in principle a self-power 
supply solution for wearable electronics (5-7). 
The performance of flexible TE devices (Fig. 1A), 
which is usually characterized by the normalized 
power density (Pmax/AAT’, where Pyax is the 
maximum power output, A is the cross-sectional 
area, and AT is the temperature difference 
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across the device) (8), is strongly dependent on 
the TE figure of merit (7) of n- and p-type TE 
materials, as well as device integration tech- 
nology. Currently, three typical approaches are 
used to fabricate high-performance flexible 
thermoelectrics. The first approach is to de- 
posit classic brittle, inorganic, thin-film TE 
materials such as Bi,Te3-based alloys on 
flexible substrates or scaffolds (9-11). However, 
because of the difficulty in achieving dense 
films and/or in precisely controlling the che- 
mical compositions, the material’s zT ob- 
tained by this approach is usually low. This 
leads to a small normalized power density 
(Fig. 1B). The second approach is directly 
using organic TE materials to fabricate flex- 
ible devices (72). Unfortunately, because of 
their poor electrical transport, the z7s of 
organic TE materials (Fig. 1C), particularly 
for n-type materials (13, 14), are much lower 
than those for state-of-the-art inorganic TE 
materials. This also leads to very low nor- 
malized power density, on the order of 10° to 
10° uW cm K> (Fig. 1B) (15, 16). 

Recently discovered ductile TE semicon- 
ductors provide a third avenue for high- 
performance flexible thermoelectrics because 
of their excellent metal-like machinability. 
Inorganic Ag»S-based materials and InSe single 
crystals with good ductility and tunable electri- 
cal properties have been reported (17, 78). After 
this development, Ag,(Se,Te,S)-based materials 
with both high g7 and inherent ductility were 
developed, pushing the zT value of n-type 
ductile TE materials to 0.44 at room temper- 
ature, which is among the highest values in 
flexible TE materials (Fig. 1C). However, the 
current 27 value of p-type ductile TE materials 
at room temperature is still very poor (0.01 at 
300 K) (19). Thus, the fabrication of flexible TE 
devices based on ductile TE semiconductors 
with the conventional, and often most practi- 
cal, cross-plane m-shaped structure (Fig. 1A) are 
quite challenging because of the lack of good 
p-type ductile TE materials. We report a series 
of high-performance p-type ductile TE mate- 
rials, AgCu(Se,S,Te) pseudoternary solid sol- 
utions, with a gT of 0.45 at 300 K and 0.68 at 
340 K, attractive values for flexible TE mate- 
rials (Fig. 1C). We successfully developed the 
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flexible 1-shaped TE devices (0.3 mm thick) 
based on inorganic materials, with a maximum 
normalized power density up to 30 uW em? K-?. 
This value is about four times that of the 
Bi,Te,-based TE devices (20-22), at least four 
orders of magnitude higher than that of organic- 
based flexible TE devices (Fig. 1B) (73-15), and 
one order of magnitude higher than that of 
other low-grade heat-to-electricity conversion 
technologies (8, 23, 24). 

The phase diagram and chemical defects are 
quite complicated in the (Ag, Cu)-X (X = S, Se, 
Te) family, which includes Ag,X, AgCuX, and 
their derivatives, and also provides an op- 
portunity for tuning their transport and 
mechanical properties through composition and 
structure modulation. Generally, Ag-contained 
compounds are n-type, whereas Cu-contained 
ones are p-type; a high S content or a high Ag 
content tends to result in ductile materials, 
whereas high Se and Te contents or high Cu 
content are prone to be brittle materials (79). 
We focused on AgCu(Se,S,Te) pseudoternary 
solid solutions in an attempt to find high- 
performance p-type ductile TE materials. We 
use a composition-performance phase diagram 
to show the relationship that we obtained ex- 
perimentally (Fig. 2). Additional details on the 
crystal structures and phase compositions of 


AgCu(Se,S,Te) pseudoternary solid solutions 
can be found in the supplementary materials 
and in figs. S1 to S3. AgCuSe is brittle and shows 
n-type conduction, with a maximum bending 
deformation of ~3% in the three-point bending 
test (fig. S4A). Alloying Te in AgCuSe does not 
change the brittle feature (fig. S4A) but does 
convert the electrical conduction to p-type. The 
boundary of p/n conductions in AgCu(Se,_,Te,) 
is ~y = 0.5 (fig. S5). Upon further alloying a 
tiny amount of S in AgCu(Se,_,Te,) (0.7 = y = 
0.5), a “brittle-ductile” transition occurred, 
and we obtained ductile p-type materials in 
AgCu(Sey-y.~52Te,) with x between 0.05 and 
0.08 and y between 0.5 and 0.7 (Fig. 2). 

AgCuSeo3.257leo.7 (v = 0.06 and 0.08) are 
ductile and plastic materials, as determined 
with mechanical tests (Fig. 3, A and B). Room 
temperature compression and three-point bend- 
ing tests showed strain values of >30% and 
10%, respectively, which are comparable to 
ductile Ag.S and its derivates (17, 25, 26). We 
also observed a similar metal-like ductility in 
other AgCu(Se,.,..5,1e,) pseudoternary solid 
solutions over a broad composition range, 
such as AgCuSeg5..S,le€o5 (@ = 0.05 and 0.07), 
AgCuSeo 35S0.051 €0.6 (fig. S4B), and cation- 
deficient (AgCu);.sS€9.2250.0sl€o.7 (6 a 0.002) 
(Fig. 3, A and B). 
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Fig. 1. Flexible TE materials and devices. (A) Schematics of flexible thermoelectrics that could be used in 
wearable electronics with a cross-plane n-shaped structure. (B) Normalized power density (Pmax/AAT°) of 
flexible TE devices achieved by using ductile materials with different filling factors (f). Data for BisTe3 
film-based devices, organic material-based devices, and traditional BizTe3—based bulk devices are included 
for comparison. (C) Room-temperature thermoelectric figure of merit (z7) and Seebeck coefficient (c) for 
ductile inorganic materials and organic-based materials. The detailed data in (B) and (C) and the related 


references are listed in tables S1 and S2. 
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We used AgCuSeg.22So.981€o,7, Which has the 
highest zT among all of the fabricated p-type 
materials tested (Fig. 2), as the typical exam- 
ple to perform the detailed microstructure 
characterization. After compression tests, we 
observed obvious slip bands along different 
directions on the material’s surface, indicating 
the existence of multiple slip bands (Fig. 3C). 
Inside each slip band, we observed numerous 
finer slip lines (fig. S6A). The space between 
the nearby parallel slip lines was dozens of 
nanometers. In addition, numerous dimples 
with sizes ranging from several hundred nano- 
meters to several micrometers appeared on 
the fracture surface of AgCuSeg 22So9.9sT€o.7 
(Fig. 3D), which was different from the river- 
like patterns on the fracture surface of brittle 
AgCuSeo 3Teo.7 (fig. S6B). Similar to many 
metals, these dimples are considered to be 
produced by the nucleation, growth, and ag- 
gregation of the microscopic cavity during 
plastic deformation (27, 28). Because of their 
good ductility, the AgCu(Se,S,Te) materials can 
be easily machined to flexible films with a 
thickness down to 100 um (Fig. 3E). 

The good ductility of AgCuSeg 3..5,Teo.7 (@ = 
0.06 and 0.08) comes from the alloyed S, even 
though the content is very low. The engineer- 
ing strain was just 3% for S-free AgCuSeg 3Teg.7 
in the three-point bending test, but was sub- 
stantially enhanced to ~13 and 18% by alloying 
S contents of 0.06 and 0.08, respectively (Fig. 
3F). We observed a similar phenomenon in 
S-alloyed AgCuSe, ;Teo; (fig. S4, A and B). To 
understand the underlying reason for this 
sort of brittle-ductile transition, we calculated 
the slipping energy barrier (4) and cleavage 
energy (E,) of S/Te-alloyed AgCuSe. The &, 
along the (010)[100] slip system was almost 
unchanged upon alloying either S or Te; how- 
ever, the EF, was obviously increased upon 
alloying S but decreased upon alloying Te 
(fig. S7). The enhanced ratio of E,/E, caused 
by the increased amount of multicentered 
and diffused Ag-S bonds (17) is beneficial for 
the material’s slip before fracture. 

Beyond the attractive ductility, AgCuSeo 3.» 
S,Teo.7 (@ = 0.06 and 0.08) showed p-type 
conduction with relatively good TE properties. 
We measured a positive Seebeck coefficient o 
(fig. S8A) and Hall coefficients Ry, (fig. S8B), 
indicating the p-type conduction behavior. 
Upon alloying S, the a was enhanced but the 
electrical conductivity o was lowered (fig. S8, 
A and C) because of the lowered hole concen- 
tration py (fig. S8D). Because the Ag-S bond is 
more ionic than Ag-Se and Ag-Te bonds, the 
Cu-deficiency defects were greatly suppressed 
when alloying S, leading to the decreased re- 
duced Fermi energy (fig. SSE) and hole con- 
centrations. Likewise, both carrier thermal 
conductivity (kc) and lattice thermal conduc- 
tivity (k,) were also reduced upon alloying S 
(fig. S8F), yielding the lowered thermal con- 
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Fig. 2. Composition- 4 i 

performance phase dia- 05 
gram of AgCuSe-AgCuS- 0.4 
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solid solutions. Columns 
represent zT at room tem- 
perature. Blue and red 0.1 
symbols represent n- and 
p-type conduction, respec- 
tively. Filled circles repre- 
sent ductile compositions 
and hollow circles brittle 
compositions. 


ductivity « (fig. S8G). The power factors (PF = 
ao) of AgCuSeo3..5,1eo7 (v = 0.06 and 0.08) 
were ~3 u.W cm” K® (fig. SSH), whereas the 
thermal conductivities « (fig. S8G) were only 
~0.3 to 0.4 Wm’ Kt leading to a 2T of up to 
0.31 at 300 K (Fig. 3G). Upon further intro- 
ducing a tiny amount of cation deficiency, we 
were able to enhance the PF to 5.1uW cm K~? 
at 300 K (fig. S8H). Although the « of cation 
deficient material was slightly enhanced be- 
cause of the increased kc (fig. S8, F and G), the 
2T was still improved to 0.45 at 300 K and 0.68 
at 340 K (Fig. 3G) for (AgCu)o 9985€0.22S0.0sT €o7, 
respectively. Likewise, good reproducibility of 
TE properties was observed in our ductile TE 
materials (fig. S9). 

The g7 values of our ductile semiconduc- 
tors were much higher than those of organic 
materials (Fig. 1C), even close to the commer- 
cial zone-melted Bi,Te3-based alloys. These 
high g7T values can be mainly attributed to 
their ultralow «x; originating from the severe 
atomic-lattice disorders caused by the com- 
plex chemical compositions (29). The « of duc- 
tile AgCuSeo 3..S,Teo7 (v = 0.06 and 0.08) was 
much lower than that of the classic Bi:Te3- 
based materials (1.0 to 1.8 Wm! K™! at 300 K) 
(21), and was even comparable to that of wood 
(0.1 to 0.3 Wm’ K“?) (80). Such an ultralow 
« is helpful for realizing a large temperature 
gradient across a TE device. In addition, ductile 
TE materials have larger o and PF than the 
organic TE materials (Fig. 3H), which is bene- 
ficial for achieving high onset voltage with 
fewer p/n couples in TE devices. 

The discovery of high-performance n- and 
p-type ductile TE materials offers the opportu- 
nity to fabricate flexible cross-plane m-shaped TE 
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devices (Fig. 4A). We selected p-type (AgCu)o.99g 
Seo .225o,0sT€o,7 and n-type AgooS7Tes (Fig. 1C) 
to connect electrically in series and thermally 
in parallel between two polyimide (PI)-based 
flexible circuit boards. Taking advantage of 
the materials’ ductility, we cut n- and p-type 
ultra-thin plates with a thickness of 0.1 mm 
directly from bulk ingots. The intrinsic brittle- 
ness of traditional thermoelectric alloys such 
as the Bi,Te3-based ones prevents fabricat- 
ing a similar thickness in these materials. 
We metallized the thin plates by magnetron 
sputtering using tungsten (W) and tin (Sn) 
as contact layers with an electrical contact 
resistivity (p,) of 19 and 2 1.Q cm? for Sn/W/ 
(AgCu)o 998S€0.22So.0sT€o,7 and Sn/W/Ag20S7Tes 
interfaces (Fig. 4B), respectively. These small 
Pc values, as well as the large thermal con- 
ductivities of W and Sn, suggest that the ad- 
ditional thermal resistance introduced by the 
contact layer was very low. The metalized thin 
films were cut into the assigned shapes and 
then soldered onto the Cu/Au electrodes in 
PI-based flexible circuit boards using a low- 
temperature solder paste (fig. SIOA). 

We successfully fabricated a few flexible 
cross-plane m-shaped TE devices with six-couple 
(AgCu)o.998S€o.22S0.08T €0.7/A20S7Tes legs (Fig. 
4C). We designed the same dimensions of TE 
legs but different filling factors (f) with the 
values of 38, 55, and 72%. The total thickness 
of our flexible TE was only 0.3 mm with an 
upper limit of several millimeters, like a normal 
rigid device. Therefore, the thickness of our 
flexible device can be freely tuned from 0.3 mm 
to a few millimeters. We characterized the 
performance of these devices using a home- 
made test platform (fig. SIOB). We show the 
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output voltage (V) and output power (P) asa 
function of current (J) under different AT for 
the device with f= 72% in Fig. 4D. The device 
internal resistance (Rj,) was ~18 mQ, which 
is comparable to the theoretical value (19 mQ) 
calculated on the basis of the material’s elec- 
trical resistance. This R;, was at least three 
orders of magnitude lower than that of flexible 
organic material-based TE devices (14, 15) and 
two orders of magnitude lower than that of 
classic bulk Bi,Te3-based TE devices (fig. S11). 
Such low Rj, caused by the super-thin TE legs, 
is beneficial for achieving super-high output 
power P. The open circuit voltage (Voc) and 
maximum output power (Pmax = Voc?/4Rin) 
increased with increasing AT, reaching 3.7 mV 
and 203 uW at AT =1.5 K (Fig. 4D), respec- 
tively. The maximum normalized power den- 
sity Pmax/AAT’? was ~30 uW cm” K~, a high 
value compared with other reported flexible 
TE devices and even bulk TE devices (Fig. 1B) 
(14-16, 20-22). 

We also tested the performance of the de- 
vices with f = 38% and f= 55% (fig. S12, A and 
B). Under similar AT, the device with higher 
f had higher Vo¢/A and Pyy,;/AAT ? (fig. S12, 
Cand D). The Pmax/AAT” values were ~13 and 
20 uW cm” K~? when f = 38 and 55%, respec- 
tively, still much higher than that of current 
TE devices (Fig. 1B) (14-16, 20-22). Such 
ultrahigh performances are mainly caused by 
the material’s high TE figure of merit, short 
length of TE legs, and low energy loss at the 
material/electrode interfaces. 

The super-thin and highly efficient flexible 
TE devices developed in this study can be 
implemented into the Internet of Things, 
quartz watches, or motion detectors, acting 
as a charge-free battery to monitor human 
health or the environment. Toward these 
potential applications, we fabricated a cross- 
plane n-shaped TE device including 31-couple 
(AgCu)o.998S€0.22S0.08T€0.7/A820S7Tes with f = 
33% (Fig. 4C, right). This device can easily and 
effectively adhere to the curved surface of 
human skin using adhesive tape (Fig. 4E). 
Under an ambient temperature of 298 K and 
humidity of 42%, directly adhering the device 
on a human’s wrist gave a Voc and Pymax of 
0.2 mV and 70 nW, respectively (fig. SI3A). The 
Prmax/AAT’ of this device was ~11 .W em K~, 
the same as for the six-couple device with a 
similar filling factor. By shaking the arms to 
strengthen the convective heat transfer, the 
AT across the TE device could be enhanced 
to 0.1 K to yield a Voc and Pmax of 0.76 mV 
and 1 uW, respectively (fig. S13B). Spraying 
alcohol on the device’s upper surface could 
cause a large AT (0.35 K), yielding a high Voc 
and Pymax of 2.5 mV and 13 uW, respectively 
(fig. S13C). In real applications, various heat- 
dissipation approaches (e.g., copper pipes and 
graphite sheets) will be applied to further en- 
hance AT to 1 to 2 K to reach the threshold 
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Fig. 3. Mechanical and thermoelectric properties of (AgCu);.sSeo,3..S,T€po,7- (A and B) Engineering 
stress-strain curves for compression (A) and bending (B) tests for (AgCu);-sSeo0.3-xSxTeo,7 (x = 0, 0.04, 
0.06, and 0.08; 6 = 0 and 0.002) at room temperature. Data for AgoS and its n-type derivates are included 
for comparison (17, 25, 26). (© and D) SEM images of the surface (C) and fracture surface (D) of 
AgCuSe€o.22So.0gl€o7 after the compression test. (E) Optical image of the AgCuSep.22So.9gT€o7 thin film 
showing good flexibility. (F) Maximum engineering strains of bending test as a function of S content for 
AgCuSeo5-xSxTeo.5 and AgCuSe€o,3.xS,Teo,7. (G@) Temperature dependence of TE zT for (AgCu)-sS€03-xSy1€07 
(x = 0, 0.06, and 0.08; 6 = 0 and 0.002). (H) Absolute value of Seebeck coefficient (a) as a function 

of electrical conductivity (c) for p-type AgCu(Se,S,Te) ductile TE materials. Data for n-type Ag2S-based 
ductile materials and typical organic-based TE materials are included for comparison. The detailed data in (H) 
and the related references are listed in table S1. 
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Fig. 4. Super-thin flexible TE devices based on ductile semiconductors. (A) Structure of the m-shaped 
flexible (AgCU)o,99gS€o.22S0,081 €0,.7/AZ29S7Te3 TE device. The bottom panel is the SEM image for the overall 
cross-section of the flexible device with the thicknesses of each layer labeled. (B) Resistance (R) line 
scanning across the interface for the p- and n-type TE elements. (C) Optical images of the as-prepared 
six-couple flexible TE devices with different filling factors (f = 38, 55, and 72%) and 31-couple flexible TE 
devices (f = 33%). (D) Measured output voltage (V) and output power (P) as a function of current (/) under 
different temperature differences for the six-couple (AgCU)o.998S€0,22S0.081 €0.7/A820S71e3 flexible TE 
device with f = 72%. (E) Optical image of the 31-couple flexible TE device adhered on a human's wrist. 


of 100 uW, which is strong enough to power 
many microelectronics such as quartz watches 
and wireless sensor nodes. Furthermore, the 
total thickness of our flexible device can be 
freely tuned for various working conditions 
with different temperature gradients and 
working spaces. 

We also tested the service stability of our 
flexible TE device by cyclically bending the 
31-couple device with a filling factor of 33% 
under a bending radius of 15 mm. After bend- 
ing 500 times, the relative inner resistance 
(Rin/Rino) Was scarcely changed (fig. S13D). 
A service stability test was also performed in 
two long-strip-shaped devices with high filling 
factors of 55 and 71%. The maximum incre- 
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ment of Rip/Rino was <5% after bending 500 
times (fig. S13, E to G). These tests indicate 
that the service stability of our device is ac- 
ceptable. Furthermore, we found that humid- 
ity had little influence on the Voc and Pax Of 
the device (fig. S13H). 

In summary, we have discovered p-type 
ductile TE materials with a relatively high TE 
figure of merit compared with other flexible 
materials. Coupled with high-performance 
n-type ductile materials, we successfully devel- 
oped super-thin flexible TE devices with a 
conventional r-shape, ultra-high normalized 
power density, and reasonable service sta- 
bility. Ductile semiconductors thus provide a 
different strategy for high-performance flex- 


ible thermoelectrics, which can directly and 
efficiently convert low-grade heat energy such 
as the heat from human body into useful and 
sustainable electricity. Our study offers a 
promising example of applying a self-powered 
supply technology to wearable electronics. 
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FLEXIBLE ELECTRONICS 


Chip-less wireless electronic skins by remote 
epitaxial freestanding compound semiconductors 
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1,2,19y 


Recent advances in flexible and stretchable electronics have led to a surge of electronic skin (e-skin)- 
based health monitoring platforms. Conventional wireless e-skins rely on rigid integrated circuit chips that 
compromise the overall flexibility and consume considerable power. Chip-less wireless e-skins based on 
inductor-capacitor resonators are limited to mechanical sensors with low sensitivities. We report a chip- 
less wireless e-skin based on surface acoustic wave sensors made of freestanding ultrathin single- 
crystalline piezoelectric gallium nitride membranes. Surface acoustic wave—based e-skin offers highly 
sensitive, low-power, and long-term sensing of strain, ultraviolet light, and ion concentrations in sweat. We 
demonstrate weeklong monitoring of pulse. These results present routes to inexpensive and versatile 
low-power, high-sensitivity platforms for wireless health monitoring devices. 


lectronic skin (e-skin)-based health moni- 

toring platforms have recently emerged, 

including flexible and stretchable sen- 

sors (7), electronic circuits (2), and skin- 

compatible adhesive patches (3, 4) that 
laminate conformally onto the skin. Their ap- 
plications include biophysical tracking of 
fitness and wellness (5), as well as clinical 
management (4, 6). A key technical require- 
ment for the broader use of e-skins in our daily 
lives is the capability to communicate data 
wirelessly. A major issue in conventional wire- 
less e-skins is that they require rigid integrated 
circuit chips—near-field communication (NFC) 
or radio frequency identification (RFID) chips, 
microprocessors, or analog-to-digital convert- 
ers (ADCs), for example—which can compro- 
mise overall flexibility. Moreover, given the 
power constraint in wireless e-skin systems, 
the high power consumption by these chips, 
which contain thousands of transistors (7), often 
leads to reduced sensitivity (due to the power- 
sensitivity trade-off in ADCs) (8), substantial 
heat generation (9), and reduced communi- 
cation distance (10). To address these issues, 
inductor-capacitor (LC) resonator-based chip- 
less wireless e-skin sensors have been studied 
(11), but demonstrated applications are re- 


stricted to strain and pressure sensing with 
relatively low sensitivity owing to the limita- 
tions of capacitive sensor designs. 

We report a chip-less wireless e-skin tech- 
nology based on surface acoustic wave (SAW) 
sensors that achieves marked improvement in 
strain sensitivity, power efficiency, versatility 
as a sensing platform for a broad range of ex- 
ternal stimuli, and long-term wearability. This 
is achieved by integrating a freestanding ultra- 
thin single-crystalline piezoelectric membrane 
of gallium nitride (GaN) on a flexible patch 
as the material for passive wireless sensing. 
The ultrathin GaN epitaxial layers are grown 
on graphene-coated GaN substrates by re- 
mote homoepitaxy and are easily released 
from the weak graphene-GaN interface by a 
two-dimensional (2D) material-based layer 
transfer (2DLT) process (12). 

Figure 1A and fig. S1 show schematic illus- 
trations comparing conventional chip-based 
wireless e-skins and our SAW-based chip-less 
wireless e-skin (see table S1 for additional de- 
tails). Chip-based wireless e-skins integrate 
NFC or RFID chips, circuit elements (diodes, 
resistors, and capacitors), sensors, and radio 
frequency antenna on silicone patches (7). The 
large size and rigidity of these chips compro- 


mise conformal lamination of e-skin on skin 
and necessitate thick silicone patches (300 to 
500 um), which disturb skin function over time 
owing to sweat impermeability and occlusion. 
Moreover, they consume substantial power. 
In our SAW-based chip-less wireless e-skin, 
an ultrathin GaN SAW sensor replaces the 
chips, circuit components, and sensors. The 
changes in the resonant frequency of the GaN 
SAW sensor yield information about mechan- 
ical, optical, and biochemical stimuli (see 
supplementary text in the supplementary 
materials for the detailed mechanism). The 
ultrathin and light functional layers (GaN and 
metal electrodes) can be integrated on a thin 
patch of polydimethylsiloxane (PDMS; ~20 um) 
with perforations that allow removal of sweat 
and skin by-products (73), thereby improving 
the softness, skin-conformability, and long-term 
wearability compared with chip-based e-skins, 
while also consuming considerably less power. 
Wireless SAW sensors have traditionally re- 
lied on thick, non-freestanding GaN films that 
are tethered to the wafer, which were difficult 
to integrate on flexible e-skins. We estimated 
that the GaN film thickness required for con- 
formal skin adhesion is 300 nm (Fig. 1B; see 
supplementary materials for detailed calcula- 
tion). Remote epitaxy can yield single-crystalline 
GaN membranes with a thickness of <300 nm 
and reduce material cost for GaN films (72). 
We successfully performed the remote epitaxy 
of 200-nm-thick GaN membrane on graphene- 
coated GaN substrates, followed by exfoliation 
via 2DLT (fig. $2). High-resolution x-ray dif- 
fraction (HRXRD) phi scan data in Fig. 1C 
shows the perfect single-crystallinity of GaN 
on graphene. Figure 1D shows cross-sectional 
transmission electron microscopy (TEM) and 
associated selective-area diffraction images of 
the exfoliated GaN membrane on metal sup- 
port layers indicating solely oriented atomic 
planes of GaN nanomembrane. Atomic force 
microscopy analysis (fig. $3) also confirms 
the atomically flat surface of GaN. These 
properties are important for high-performance 
electromechanical and optoelectronic devices. 
We further verified the capability of our GaN 
membranes to laminate conformally on skin by 
attaching them on a skin replica made of Eco- 
flex silicone [see methods for preparation steps 
and Fig. 1E for scanning electron microscopy 
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Fig. 1. Chip-less wireless e-skin based on single-crystalline freestanding nano- 
membranes of GaN and its mechanical properties. (A) Comparison between 
(left) conventional wireless e-skin based on integrated circuit chips and (right) our 
chip-less wireless e-skin based on surface acoustic wave (SAW) devices made of 
GaN freestanding membranes. (B) Estimation of minimum GaN thickness required 
for conformal lamination on human skin. Strain energies on curved surfaces per 
area depending on GaN thickness and the work of adhesion of a PDMS skin patch 


(SEM) images]. We integrated our resulting 
ultrathin GaN membranes into our e-skin (figs. 
$2 and S4 to S8; see methods for more details). 

To study the optoelectronic performance of 
our GaN nanomembrane device, we fabricated 
GaN-metal Schottky junction diodes using 
200-nm-thick membranes of single-crystalline 
GaN, obtained by means of remote epitaxy 
and 2DLT, and polycrystalline GaN, obtained 
on poor-growth substrates, and compared their 
ultraviolet (UV) sensitivities. UV radiation is 
an important health indicator that existing 
chip-less wireless e-skins based on an LC reso- 
nator lack the semiconductor component to 
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Highly stretchable and deformable 
— Good conformability and breathability 


detect. Our UV sensing results (Fig. 2A) in- 
dicate a UV responsivity of 10.06 A W! at 
15.90 mW/cm? for a single-crystalline GaN 
diode, which is 37-fold higher than that of a 
polycrystalline GaN diode. This responsivity is 
also higher than those reported for e-skin sen- 
sors based on polycrystalline inorganic mate- 
rials (14) or single-crystalline Si, which has 
poor optical absorption in the UV wavelength 
regimes (75), and comparable to that of the 
state-of-the-art single-crystalline GaN UV sen- 
sors on “rigid” wafers (16). A single-crystalline 
GaN diode also exhibits higher UV sensing 
speed, recovery time, and selectivity (for UV 
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were calculated and compared. (C) HRXRD phi scan of 200-nm-thick remote 
epitaxial GaN film. a.u., arbitrary units. (D) TEM image and associated selective-area 
diffraction images of the freestanding 200-nm-thick GaN nanomembrane. (E) SEM 
images of GaN e-skins with (left) 1800-nm-thick and (right) 200-nm-thick GaN 
attached to skin replica samples. The 1800-nm-thick GaN film (left) cannot fully 
conform to the microscale features of skin, whereas the 200-nm-thick film (right) 
can deform to follow the pits and curvatures. 


light over white light) than those of a poly- 
crystalline GaN diode, as illustrated in Fig. 2, 
Band C. 

The excellent piezoelectricity and perfect 
single-crystallinity of our ultrathin GaN film 
enable wireless communication without NFC 
or RFID chips. We conceived our chip-less 
wireless e-skin by (i) discovering the formation 
of highly sensitive SAWs on freestanding 
ultrathin single-crystalline piezoelectric GaN 
membranes and (ii) reinventing the free- 
standing SAW device as a biosensing plat- 
form. Upon successful lamination on the 
human body, our e-skin can generate SAW 
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Fig. 2. High optoelectronic and surface acoustic performance of GaN-based 
single-crystalline e-skins. (A to C) GaN-metal Schottky diode UV sensors made of 
single-crystalline and polycrystalline GaN nanomembranes. (A) Responsivity of 
GaN UV sensors. The changes in photocurrent of GaN diodes in response to UV 
exposure with varying intensities. (B) Response and recovery speeds and selectivities 
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GaN UV sensors. The inset shows emission spectra of UV light and white light. 
(C) Consistency of UV response of GaN UV sensors. Slow switching speed of 
polycrystalline GaN sensors induces a gradual increase in remnant photoconductance. 
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Fig. 3. Wireless GaN SAW strain sensors on e-skins free of communica- 
tion chips or power source. (A) Schematic illustration and optical images of 
our wireless GaN SAW e-skin strain sensor. R1, R2, R3, and R4 in insets are 
120, 50, 20, and 10 um, respectively. Scale bars, 200 um. (B) Resonant 
frequency shift as a function of strain for GaN SAW e-skin strain sensors. 


that responds sensitively to physiological ac- 
tivities, the changes of which can be detected 
wirelessly through the antenna. Although pre- 
vious studies note that SAW generated by an 
ultrathin piezoelectric film grown on a sap- 
phire substrate is relatively negligible owing 
to the strong binding of the film to the sub- 
strate (17, 18), we have discovered that highly 
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crystalline GaN films that are declamped from 
the substrate can generate SAWs with an 
ideal electromechanical coupling value. 
Simulation of SAW generation in the GaN 
membrane (Fig. 2, D and E) showed that the 
electromechanical coupling coefficient k’ is 
negligible for substrate-bound GaN films that 
are thinner than 300 nm (at a resonant fre- 
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(C) Measurement of the minimum detectable strain of our GaN SAW e-skin 
strain sensors. (D) Benchmark of minimum detectable strain and energy 
consumption for battery-less wireless e-skin strain sensors reported 
previously (10, 11, 25-28). (E) Wireless pulse measurements using our GaN 
SAW e-skin strain sensors. 


quency of ~54 MHz), because substrate binding 
alters the acoustic mode and substantially re- 
duces the displacement amplitude of acoustic 
vibration. However, declamping of the GaN 
layer from the substrate substantially enhan- 
ces electromechanical coupling by eliminating 
the binding effect. We experimentally con- 
firmed that the k* value of our freestanding 
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Fig. 4. Chip-free battery-less wireless GaN SAW ion sensors and UV sen- 
sors. (A) Schematic illustrations and optical microscope image of wireless ion 
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Scale bar, 200 um. (B) Resonant frequency shift in the wireless signals obtained 
n Na* ion concentration. 
(C) Continuous wireless recordings collected from a SAW ion sensor during 
ution and distilled water 
covery characteristics. 
sors to variation in Na* ion 
ation in 5-min continuous 
measurements of the output signal for each concentration. (E) Photograph 


from a GaN SAW ion sensor in response to changes i 


a series of alternating injections of 0.86 mM NaCl so 
over the e-skin, indicating consistent response and re’ 
(D) Calibration of the responsivity of GaN SAW ion sen 
concentration. The error bars represent standard devi 


200-nm-thick GaN obtained by 2DLT is close 
to the value of 4.09% (Fig. 2F) that was re- 
ported for a 2.0-um-thick GaN SAW sensor on 
a sapphire substrate (19). As shown in the 
benchmark plot in Fig. 2G (19-24), the k* 
value of our GaN film is on par with values 
previously achieved by epitaxial GaN SAW 
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sapphire wafers. A high k” value is essential for 
wireless detection, as it increases the signal-to- 
noise ratio in the resonant frequency shift of 
the SAW device on human skin. 

Figure 3A shows the schematic illustration 
and microscopy images of our SAW-based 
chip-less wireless e-skin system. The GaN 
SAW device on a PDMS skin patch is designed 


2000 
Time (s) 


1000 


— in vivo (e-skin) 
° reference 


500 1000 


Time (s) 


1500 


and microscopy image of an e-skin attached on the back of a hand (top), in vivo 
recordings of the variation in Na* ion concentration in sweat using wireless GaN 
SAW-based e-skin (blue) and a reference conductometer (red; middle), and 

in vivo recording of the skin temperature obtained by a commercially available 
thermometer (bottom). Putting a glove on the hand and placing heating pads on 
body induces sweating. Scale bar, 500 um. (F) Continuous wireless recordings 
collected by a SAW UV sensor in response to exposures of UV light (purple) and 
white light (orange), indicating the sensor's selectivity. (G) Calibration of the 


UV sensors to exposures of different UV light 
t and error bar indicate the average and standard 
m eight different measurements. 


to be freestanding and suspended in air to 
maximize device sensitivity (see supplemen- 
tary text for detailed device structures). SAW 
resonant peak frequency changes in response to 
mechanical strain and mass due to absorption 
or desorption of ions, as well UV exposure due to 
optical absorption by GaN (see supplementary 
text for detailed mechanisms). By scanning 
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the return loss (S;,) over a range of frequen- 
cies using the external wireless reader, the 
resonant frequency of GaN SAW (53.95 MHz) 
was recorded wirelessly (see figs. S9 and S10 
for details). 

We demonstrated wireless strain sensing by 
first calibrating the resonant peak shifts in the 
SAW device in response to strain induced by 
bending the patch (Fig. 3B and fig. S11). Com- 
pared with previously reported battery-less 
wireless e-skin systems (JO, 11, 25-28), our 
single-crystalline GaN-based e-skin exhibits 
higher strain sensitivity, smaller minimum 
detectable strain (0.048%), and lower power 
consumption owing to chip-less operation 
(Fig. 3, C and D, and figs. S12 and S13). Al- 
though LC resonator-based e-skins can also 
achieve low-power sensing (Fig. 3D), they suf- 
fer from low sensitivity and limited sensing 
modality outside that of mechanical stimuli 
(11, 29, 30). Their low sensitivity also neces- 
sitates highly sophisticated and costly vector 
network analyzers with low trace noise for 
wireless measurement. 

The high strain sensitivity of a SAW sensor 
on e-skin allows the continuous chip-less wire- 
less measurement of arterial pulse waves on 
the wrist. Figure 3E and movie S1 illustrate the 
continuously wireless monitoring of pulse sig- 
nals using our e-skin. Figure S14 shows the 
results of daytime (worn for ~17 hours/day) 
monitoring of heart rate and pulse waveforms 
(measured after exercise) over 7 days, which 
demonstrates the reusability and long-term 
wearability of our e-skin. Continuous wire- 
less recording of strain through bending (fig. 
$15) and stretching (10.3% strain; fig. S16) 
cycles shows that our e-skin can undergo re- 
versible bending and stretching without frac- 
turing of the GaN owing to thin, deformable 
metal interconnects (30). 

Our SAW sensor’s communication range 
is ~14 mm in the 2 direction, ~7.5 mm in the 
x direction, or a rotation of ~30° (fig. S17). The 
relatively short communication range leads 
to limited monitoring of heart rate during 
active body movement (fig. S18). Replacing the 
near-field electrically small antenna with a far- 
field loop antenna may lead to longer-distance 
wireless communication thanks to the very 
low power consumption of our SAW sensor 
(see Fig. 3D for power comparison; methods 
and fig. $13 for estimation of power consump- 
tion in SAW sensors; and supplementary text 
for wireless distance and comparison with 
chip-based e-skins). 

We also explored the wireless monitoring of 
ion concentrations in sweat, which can serve 
as indicators for conditions such as hypona- 
tremia, kidney failure, and hypertension (37), 
using our SAW-based e-skin platform. GaN 
has an extremely low dissolution rate in 
aqueous solutions and can therefore yield a 
stable, biocompatible sweat sensor. We coated 
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the surface of a GaN SAW device with ion- 
selective membranes (ISMs) that can trap 
specific ions, which induces changes in the 
viscosity and mass of ISMs (see Fig. 4A, fig. 
S19, and table $2 for the design and compo- 
sition of the SAW ion sensor). The resonant 
peak shifts in an ISM-coated SAW device can 
be used to wirelessly detect variation in ionic 
concentrations in surrounding fluids. 

Figure 4, B to D, illustrates the results of 
in vitro wireless detection of Na* ions using an 
Na* ISM-coated GaN SAW device on e-skin. 
Continuous recordings in Fig. 4, B and C, 
indicate clear and consistent response and 
recovery of the output signal, collected by ex- 
posing the e-skin to aqueous NaCl solutions 
containing varying concentrations of Na* ions 
(see methods for detailed methodology). Cal- 
ibration data in Fig. 4D, which represent the 
time-averaged response of the SAW sensor 
from 5-min continuous recordings, also indi- 
cate consistent sensor behavior. The detection 
limit of 0.86 mM represents a small value 
compared with the biologically relevant range 
of >10 mM in sweat, indicating the high sen- 
sitivity of the GaN SAW ion sensor. The sensor 
selectivity for Na* ions was confirmed by mon- 
itoring its response to flows of ionic solutions 
with different compositions (fig. S20). 

Figure 4E shows images of a GaN SAW- 
based e-skin attached on the back of a hand 
for in vivo sweat analysis (see fig. S21 for 
measurement setup). Na” ion concentration 
in sweat was monitored simultaneously by a 
SAW sensor and a reference conductometer, 
while a commercial thermometer recorded 
skin temperature. The results show that the 
concentration of Na* ions in sweat gradually 
increases as perspiration continues (5). Inte- 
gration of multiple types of ISMs on SAW de- 
vices may yield a wireless e-skin platform with 
improved versatility for sweat analysis. 

Strong optoelectronic characteristics and 
piezoelectricity of GaN also enable wireless 
detection of UV light. SAW UV sensors (see 
fig. S22 for structures) respond to UV light 
(wavelength: ~325 nm) but not to white light 
(wavelength: ~595 nm), as shown in Fig. 4F. 
The results highlight the sensor’s selectivity 
for UV owing to the matching bandgap of 
GaN, which eliminates the need for an optical 
filter and simplifies sensor design (fig. S23). 
Ion and UV (Fig. 4G) sensing capabilities ofa 
GaN SAW-based e-skin illustrate the diverse 
sensing options that can be achieved using our 
chip-less wireless platform. Figure S24 also 
illustrates our sensors’ capability to distin- 
guish different types of input stimuli. 

Here, we report a chip-free wireless e-skin 
platform based on SAW sensors made of free- 
standing ultrathin single-crystalline piezoelectric 
GaN films prepared by remote homoepitaxy 
and 2DLT to yield SAW devices with a high 
electromechanical coupling coefficient. SAW- 


based e-skins offer a versatile biomedical sen- 
sing platform with high sensitivity, low power 
consumption, and long-term wearability. 
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METALLOENZYMES 


Structures of the nitrogenase complex prepared 
under catalytic turnover conditions 


Hannah L. Rutledge, Brian D. Cook, Hoang P. M. Nguyen, Mark A. Herzik Jr.*, F. Akif Tezcan* 


The enzyme nitrogenase couples adenosine triphosphate (ATP) hydrolysis to the multielectron 
reduction of atmospheric dinitrogen into ammonia. Despite extensive research, the mechanistic 
details of ATP-dependent energy transduction and dinitrogen reduction by nitrogenase are not well 
understood, requiring new strategies to monitor its structural dynamics during catalytic action. 
Here, we report cryo-electron microscopy structures of the nitrogenase complex prepared under 
enzymatic turnover conditions. We observe that asymmetry governs all aspects of the nitrogenase 
mechanism, including ATP hydrolysis, protein-protein interactions, and catalysis. Conformational 
changes near the catalytic iron-molybdenum cofactor are correlated with the nucleotide-hydrolysis 


state of the enzyme. 


educed forms of nitrogen are essential 
for the biosynthesis of amino acids and 
nucleic acids as well as the production of 
fertilizers and many commodity chem- 
icals (7). As the only enzyme capable of 
nitrogen fixation, nitrogenase catalyzes the 
eight-electron reduction of atmospheric nitro- 
gen (N,) and protons (H*) into ammonia (NH) 
and hydrogen (Hz) (2, 3) (Fig. 1A). Nitrogenase 


is a two-component enzyme, which, in its most 
common form, consists of the iron protein FeP 
(a Y2 homodimer) and the molybdenum-iron 
protein MoFeP (an of. heterotetramer) (Fig. 
1B) (4, 5). Nitrogenase is distinct from most 
redox enzymes in its requirement for adeno- 
sine triphosphate (ATP) hydrolysis to enable 
the successive transfer of electrons and protons 
for substrate reduction (6, 7). The coupling 


of ATP hydrolysis to electron transfer (ET) 
is mediated by FeP, which forms a specific, 
nucleotide-dependent complex with MoFeP 
and hydrolyzes two ATP molecules for the 
transfer of an electron to MoFeP (4-7). This 
part of nitrogenase catalysis is termed the 
FeP cycle (Fig. 1A) (2, 8). In the MoFeP cycle 
(Fig. 1A) (2, 8), the electrons from FeP are 
received by the P-cluster (an [8Fe:7S] complex) 
of MoFeP and relayed to the iron-molybdenum 
cofactor FeMoco (a [7Fe:9S:C:Mo]-homocitrate 
complex), where N, binding and reduction 
occur. Specific ATP-dependent interactions 
between FeP and MoFeP are necessary not 
only for interprotein ET but also for gated 
ET between the P-cluster and FeMoco through 
long-distance conformational perturbations, 
the nature of which are not understood 
(6, 7). 

Although the dynamic coupling between 
ATP hydrolysis and ET ultimately drives No 
fixation, it also creates experimental chal- 
lenges for a mechanistic understanding of 
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Fig. 1. Catalytic cycle for nitrogenase and its structural characterization under 
nonturnover and turnover conditions. (A) Chemical reaction catalyzed by 
nitrogenase. There are eight FeP cycles in each MoFeP cycle. (B) Crystal structure 
(PDB ID 1M34) (10) of the 2:1 FeP:MoFeP complex stabilized by MgADP.AIF,, showing 
the relative positions of the individual FeP (y' to y*) and MoFeP (07810782) subunits, 
the nucleotides, and the metalloclusters. The FeP subunits are shown in dark green 
(y' or y°) and light green (y’ or y*), and the MoFeP subunits are highlighted in 

light blue (ot), dark blue (a2), light orange (B"), and dark orange (8°). Black arrows 
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indicate the path of electron flow. (C) Cryo-EM maps of 'SMoFeP (~1.8-A resolution) 
obtained in the absence of FeP: “°Complex-1 (~2.3-A resolution) and “°Complex-2 
(~2.3-A resolution) obtained under turnover conditions; and the BeF,-trapped FeP- 
MoFeP complex (~2.4-A resolution). Coloring scheme for the subunits is the same as 
in (B). Cryo-EM maps for nucleotides are shown as a gray mesh and contoured at 
the following levels: ‘SMoFeP, 0.008; “°Complex-1, 0.17; and “°Complex-2, 0.075. 
Magnified views of the nucleotides bound to the nitrogenase complexes and their 
corresponding cryo-EM densities are shown in boxes. 
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nitrogenase. First, substrates and inhibitors 
can only bind reduced forms of FeMoco, whose 
generation requires continuous turnover con- 
ditions that include ATP and reduced FeP 
(2, 3). Because MoFeP is also an inherent 
hydrogenase, such reduced forms of FeMoco 
promptly return to the resting state through 
H, evolution upon termination of ATP hydrol- 
ysis (8, 9), rendering substrate-bound states of 
FeMoco too fleeting for structural characteri- 
zation. Second, the necessity of continuous 
ATP hydrolysis for catalysis leads to a hetero- 
geneous distribution of nitrogenase substates 
that are difficult to interrogate experimentally. 
These substates differ not only in the FeP- 
MoFeP complexation state and the extent of 
ATP hydrolysis within the FeP cycle but also 
in the oxidation states of the metal cofactors 
and the reaction intermediates on FeMoco 
(Ep to Eg) within the MoFeP cycle (Fig. 1, A 
and B). 

Previous crystallographic studies have pro- 
vided detailed views of the nucleotide-dependent 
conformations of the FeP-MoFeP complexes 
and outlined the path of ET in nitrogenase 
(Fig. 1B) (0, 1D). Yet, these static views, which 
were obtained under nonturnover conditions 
and constrained by crystal lattice, showed no 
variation in the structure of MoFeP, providing 
little insight into the mechanism of ATP/FeP- 
mediated redox events within this protein. 
Recent crystallographic studies of MoFeP and 
its vanadium analog, VFe-protein, revealed 
possible modes of ligand interactions with 
FeMoco and showed that the cofactor is cap- 
able of undergoing compositional changes 
(12-14). However, these structural snapshots 
were obtained using crystals that formed hours 
to days after the last enzymatic turnover re- 
action, and their catalytic relevance remains 
an open question. In parallel, extensive freeze- 
quench spectroscopic studies of nitrogenase 
have characterized substrate- and intermediate- 
bound states of FeMoco (15, 16). Yet, these 
methods do not report on the atomic structure 
of the cofactor as a whole and provide little 
information regarding how the local cofactor- 
ligand interactions are linked to global ATP/ 
FeP-dependent structural dynamics of MoFeP. 
Consequently, we lack a detailed understand- 
ing of why and how ATP hydrolysis is used to 
drive N, fixation and how catalysis at FeMoco 
proceeds. Clearly, new experimental approaches 
are needed to structurally interrogate nitro- 
genase during catalysis at or near atomic 
resolution to simultaneously characterize the 
ATP-dependent FeP-MoFeP interactions and 
the associated structural dynamics within 
each protein. 

We sought to overcome some limitations of 
prior studies by using cryo-electron micros- 
copy (cryo-EM) to directly visualize nitro- 
genase under enzymatic turnover conditions. 
We prepared cryo-EM samples of Azotobacter 
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vinelandii nitrogenase under high-electron 
flux turnover conditions (see supplementary 
methods), which contained a 10-fold molar ex- 
cess of FeP over MoFeP and low ionic strength 
(<25 mM NaCl) to favor the formation of 
electrostatically driven FeP-MoFeP complexes. 
We included sufficient MgATP and dithionite 
(5 mM each) to ensure that they are not de- 
pleted during turnover while minimizing back- 
ground electron scattering. Turnover samples 
were prepared anaerobically under a Nz atmo- 
sphere in an anaerobic glovebox and flash 
frozen in liquid N, within 15 s after initiation 
of turnover. For purposes of rapid manual 
freezing of cryo-EM grids, these turnover sam- 
ples were removed from the glovebox, thawed, 
applied to grids, blotted, and frozen within 
<15 s. The ~30-s total preparation period was 
sufficiently long to ensure that steady-state 
catalytic conditions were reached but short 
enough such that there was still remaining 
reductant and MgATP in solution (see supple- 
mentary methods). We collected a large cryo-EM 
dataset (>15,000 movies) that yielded >4.5 mil- 
lion usable particles (figs. S1 and S2 and table 
S1). Through exhaustive two-dimensional (2D) 
and 3D classification and refinement, we iso- 
lated free MoFeP (~60%) and FeP particles 
(~5%), as well as FeP-MoFeP nitrogenase as- 
semblies (~35%), from this heterogeneous mix- 
ture. We determined the structures of two 
conformationally distinct 1:1 FeP:MoFeP com- 
plexes under turnover conditions, designated 
Y/eComplex-1 and “°Complex-2, at ~2.3-A reso- 
lution (Fig. 1C). Both complexes contain intact 
FeP and reduced, all-ferrous P-clusters (vide 
infra), indicating the lack of potential oxida- 
tion or damage by O, to the clusters. As a 
reference, we also obtained a ~1.8-A resolu- 
tion cryo-EM structure of resting-state MoFeP, 
termed “MoFeP, using the same turnover con- 
ditions but in the absence of FeP (Fig. IC, fig. 
83, and table S2). 

Given the C, symmetry of MoFeP and the 
large separation (>65 A) between the nearest 
clusters from the symmetry-related a8 sub- 
units, it has long been assumed that the two 
af halves of MoFeP function independently 
from one another (2, 8). In support of this 
assumption, the crystal structures of various 
FeP-MoFeP complexes in different nucleo- 
tide-bound states largely possess a 2:1 FeP: 
MoFeP stoichiometry (Fig. 1B) (J0, 11, 17). Un- 
expectedly, our turnover samples did not con- 
tain any particles that could be assigned to a 
2:1 FeP:MoFeP complex (Fig. 1C and fig. S1). 
We considered the possibility that the exclu- 
sive observation of 1:1 complexes in our turn- 
over samples could arise from an experimental 
artifact such as protein degradation or an in- 
crease in ionic strength during cryo-EM grid 
preparation. Therefore, we prepared a second 
set of turnover samples in the same manner as 
above that also included 5 mM beryllium fluo- 


ride (BeF,,), which is known to arrest ATP 
hydrolysis in a transition-like state to yield 
quasi-irreversible, solution-stable 2:1 and 1:1 
FeP-MoFeP complexes (18). Accordingly, our 
cryo-EM samples contained a large fraction 
of 2:1 FeP:MoFeP complexes alongside 1:1 spe- 
cies but no detectable free MoFeP particles 
(figs. S4 and S5 and table $3). We determined 
the structure of the MgADP.BeF,,-bound 2:1 
FeP:MoFeP at ~2.4-A resolution and found 
it to be isostructural to the crystal structure 
of the related MgADP.AIF,-bound 2:1 FeP: 
MoFeP complex [Protein Data Bank (PDB) ID 
1M34; Fig. 1C and fig. $5]. These observations 
affirm that our cryo-EM samples contain in- 
tact proteins and operate under native turn- 
over conditions, in turn indicating that the 1:1 
FeP:MoFeP stoichiometry is the predominant 
nitrogenase assembly state during catalysis. 

Prior work using pre-steady-state kinetics 
measurements revealed that the extents of in- 
terprotein ET and ATP hydrolysis were ap- 
proximately half of what would be expected 
if there were two independent FeP binding 
sites on MoFeP (J9). Originally, such half- 
reactivity was attributed to either partial inac- 
tivity of FeP molecules (19) or to the possible 
existence of an alternative interaction mode 
between FeP and MoFeP (20). Recent studies 
favored a model of negative cooperativity within 
a 2:1 FeP:MoFeP complex, whereby one of the 
bound FeP molecules suppresses ATP hydrol- 
ysis by the other bound FeP and the redox 
activity of the opposite af half of MoFeP 
(21, 22). Our cryo-EM structures instead sug- 
gest that half-reactivity and negative coop- 
erativity in nitrogenase arise from MoFeP 
binding to only one FeP molecule at a time 
during turnover. 

“Yecomplex-1 and “°Complex-2 were distin- 
guished during cryo-EM data processing pri- 
marily on the basis of the structural variability 
of the FeP components. Thus, we first exam- 
ined whether these differences are associated 
with the ATP-hydrolysis state of the two com- 
plexes. Earlier crystal structures identified at 
least three nucleotide state-dependent FeP- 
MoFeP docking geometries (DG1 to DG3) and 
led to the hypothesis that FeP moves in a uni- 
directional fashion across the MoFeP surface 
during turnover (Fig. 2A) (6, 17). The DG1 state 
predominates in the absence of nucleotides 
but is also populated in the presence of ATP 
(23) and corresponds to an electrostatically 
guided encounter complex wherein FeP largely 
interacts with the B subunit of MoFeP. DG2 
is the activated nitrogenase complex in which 
ATP hydrolysis is coupled to interprotein ET, 
with FeP occupying the quasi-symmetric sur- 
face of MoFeP shared between a and £ sub- 
units (Figs. 1B and 2A) (10, 11). Finally, DG3 is 
formed by adenosine diphosphate (ADP)-bound 
FeP and primarily uses the a-subunit surface 
of MoFeP (/1). The cryo-EM analysis of our 
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Fig. 2. Structural details of nitrogenase complexes characterized under turnover conditions. (A) The 
FeP:MoFeP docking geometry (DG) in “°Complex-1 (maroon) and “°Complex-2 (blue) observed in the 

DG2 configuration (black outline) compared with the DG1 (gold; PDB ID 2AFH) and DG3 (purple; PDB ID 2AF1) 
configurations characterized by x-ray crystallography (11). (B) Structural overlay of “°Complex-1 and 
“eComplex-2, indicating that the only large-scale conformational changes are observed in FeP. “Proximal” 
and “distal” refer to the og! and a7* halves of MoFeP bound and not bound to FeP, respectively. The alll 
domains of MoFeP are outlined in gray. (C) Structural overlay of the FeP components in “°Complex-1 and 
“eComplex-2, highlighting the nucleotide-dependent conformational differences [hinging (left) and twisting 
(right)] motions of the y' and y* subunits relative to one another during ATP hydrolysis. The axes of the y100's 
helices (residues y98 to y112) that radiate from the [4Fe:4S] cluster are shown as black lines to illustrate 
these conformational differences. (D and E) Residues in the vicinity of FeMoco [(D) aHis?”*, aPhe?°, and 
aHis*®". (E) aTrp*°3] that have undergone conformational changes in the distal subunit (a2) of MoFeP during 
turnover. Movements between the resting-state (“MoFeP) (gray) and the turnover structures (maroon and 
blue) are indicated with arrows. F, Phe; H, His; S, Ser; W, Trp. 


turnover samples revealed that u °Complex-1 | relative to DG1 and DG3. Characteristic of a 


and “ °Complex-2 were exclusively in DG2 (Fig. 
2A), implying that this configuration has a 
higher stability and/or longer residence time 
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DG2 configuration, both complexes feature 
extensive interactions between FeP and MoFeP 
(buried surface areas >3600 A’) and a short 


[4Fe:4S]-to-P-cluster edge-to-edge distance of 
~15 A, primed for rapid interprotein ET (Fig. 
2B). In “°Complex-1, both FeP y subunits (y 
and y’) are occupied by ATP molecules with 
clear densities for the y-phosphate groups and 
associated Mg** ions (Fig. 1C). By contrast, the 
y' subunit of “°Complex-2 features an ATP mo- 
lecule with weak density for the y-phosphate, 
whereas the y” subunit is ADP-bound and the 
y-phosphate is completely absent from the 
nucleotide binding pocket, indicative of asym- 
metry in ATP hydrolysis (Fig. 1C). This obser- 
vation is consistent with the crystal structure 
of a mixed-nucleotide FeP-MoFeP complex, in 
which AMPPCP (a nonhydrolyzable ATP ana- 
log) and ADP were selectively bound to the 
y and va subunits, respectively (24). The dif- 
ferences in the nucleotide occupancies of 
Y°Complex-1 and “°Complex-2 are reflected 
in their distinct FeP conformations (Fig. 2C), 
which is further corroborated by a principal 
components analysis of available FeP struc- 
tures (fig. S6). Collectively, we infer that (i) 
our cryo-EM structures report on active turn- 
over states from the standpoint of ATP hydro- 
lysis by FeP, (ii) the hydrolysis of two ATP 
molecules in each FeP cycle occurs in a 
stepwise fashion, and (iii) i °Complex-1 and 
u °Complex-2 correspond, respectively, to pre- 
and mid-ATP hydrolysis states of the nitroge- 
nase complex that are populated along the 
catalytic reaction coordinate. 

The asymmetry present both in ATP hydrol- 
ysis and FeP-MoFeP interactions has impor- 
tant implications for the timing of ET events 
during catalysis. Compared with a concerted 
process, stepwise nucleotide hydrolysis by FeP 
would be expected to increase the lifetime of 
the activated DG2 complex and provide addi- 
tional conformational states for orchestrating 
the multistep redox reactions that occur at 
FeMoco and the P-cluster (6, 24). Similarly, 
an alternating docking mechanism between 
FeP and MoFeP (as imposed by negative co- 
operativity between the of halves) would ef- 
fectively slow down successive ET steps to a 
given of subunit, providing sufficient time 
for anticipated protein and metallocluster 
rearrangements during N, reduction. This 
is consistent with the suggestion of Thor- 
neley and Lowe that the slow kinetics of 
nitrogenase [turnover rate (Kiumover) * 187] 
governed by FeP-MoFeP interactions may 
be a mechanistic imperative to favor No fixa- 
tion over the competing but less demanding 
H* reduction (8). 

Having analyzed “°Complex-1 and “°Complex-2 
in the context of the FeP cycle, we next in- 
vestigated whether they exhibited any confor- 
mational changes in their MoFeP components 
that may be correlated with the ATP hydrolysis 
state. In both “°Complex-1 and “°Complex-2, 
the P-clusters are in their fully reduced, all- 
ferrous (PY) forms (fig. $7), and the protein 
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backbone arrangements of MoFeP’s in both 
complexes are essentially indistinguishable 
from one another as well as from those in 
*SMoFeP and previously determined MoFeP 
crystal structures (Co. root mean square devia- 
tion of 0.2 A; Fig. 2B, fig. S8, and table $4). We 
observed no large-scale conformational changes 
that could account for mechanical coupling 
and negative cooperativity between symmetry- 
related FeP docking surfaces on MoFeP, impli- 
cating the involvement of a dynamic allosteric 
mechanism (e.g., FeP-induced changes in 
MoFeP conformational entropy) (fig. S9) (25). 
Yet, a detailed inspection of the structures 
revealed that the FeP-free (i.e., “distal”) «8? 
half in both “°Complex-1 and “°Complex-2 
possessed several features that distinguish it 
from the FeP-bound (i.e., “proximal”) a8" half 
and "“MoFeP. 

First, there are several conserved residues 
(o? Trp’, 0? His”, «?Phe®, o?His*") (26) in 
the vicinity of the distal FeMoco that adopt 
non-resting state conformations (Fig. 2, D 
and E). aT rp” is particular in its cis peptide 
bond to oSer*™ and its position in a proposed 
substrate access channel from the protein sur- 
face to FeMoco (27, 28). The observed confor- 
mational flip in o? Trp” leads to the diversion 
of this putative channel to an alternate face 
of FeMoco (Fig. 2E and fig. S10). «?His?”, 
a’Phe®”°, and o?His**! side chains appear to 
have undergone a concerted motion com- 
pared with their resting state (Fig. 2D), where- 
by o?His?™ and o?Phe®”° assume a similar 
configuration as that seen in the low-pH crys- 
tal structure of MoFeP (fig. S11) (29). This 
aHis? configuration was proposed to form 
a water-bridged H bond to a protonated belt 
sulfur (S5A) of FeMoco (29). Along these lines, 
the observed rearrangement of the «His? 
side chain in the turnover complexes could be 
envisioned to stabilize a protonated FeMoco 
intermediate and/or to increase the reduction 
potential of the cofactor, thus promoting its 
reduction by the P-cluster. 

Second, the cryo-EM densities surrounding 
the o?His“? and homocitrate ligands to the 
Mo center of the distal FeMoco are consider- 
ably less well defined compared with their 
counterparts in the proximal of half and the 
residues in the vicinity (fig. S12) and cannot be 
modeled with the resting-state configura- 
tions of these ligands (Fig. 3). The reduction 
in map density is particularly pronounced for 
us °Complex-2 (i.e, mid-ATP hydrolysis) com- 
pared with that of uv °Complex-l (i.e., pre-ATP 
hydrolysis). These observations suggest that 
a’ His**? and homocitrate are mobile during 
turnover and that Mo undergoes changes in 
innersphere coordination in a way that is cor- 
related with the nucleotide hydrolysis state of 
FeP bound to the opposing af half of MoFeP. 
The substitution of Mo with V or Fe in alterna- 
tive nitrogenases, the replacement of homoci- 
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trate with citrate, and alterations in H bonding 
to homocitrate have been shown to substan- 
tially diminish N, reduction activity and alter 
substrate specificity (2, 30-32). Indeed, the 
direct involvement of the Mo center in No 
reduction has been proposed early on (2, 33), 
although recent experimental findings have 
shifted the focus to the central Fe centers of 
FeMoco as being the primary sites for sub- 
strate activation (15, 16). Our cryo-EM observa- 
tions provide evidence that the Mo-homocitrate 
moiety may also participate in dynamic struc- 
tural transformations that accompany cataly- 
sis at FeMoco. 

Third, portions of a large domain in the dis- 
tal of subunit comprising residues 025 to 048 
and 0378 to 0403 (particularly in “°Complex-2) 
possess increased mobility compared with the 
rest of the MoFeP (fig. S13). This so-called olII 
domain forms a lid above FeMoco and in- 


cludes residues oGlu®®° and oPhe**!, which 
form close contacts with FeMoco (Figs. 2B and 
3). In resting-state MoFeP, alll is well ordered, 
aGlu®®° forms water-bridged H bonds to the 
Mo-ligated oHis**? side chain and homoci- 
trate, and oPhe** is in van der Waals contact 
with the labile belt sulfide S2B of FeMoco (Fig. 
3A). By contrast, in the distal af halves of 
Y°Complex-1 and “°Complex-2, the cryo-EM 
densities for oIII and the side chains of «?Glu*®° 
and o?Phe** are either diffuse or entirely mis- 
sing (Fig. 3B and fig. S13). This movement is 
likely coupled to the dynamics of the Mo 
ligands and FeMoco as a whole. alIII has also 
been shown to undergo major structural rear- 
rangements associated with the insertion of 
FeMoco into MoFeP (34). Furthermore, alII 
displays some of the highest temperature 
factors in MoFeP crystal structures and is po- 
sitioned away from lattice contacts (table S5) 
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Fig. 3. Changes in the FeMoco environment observed during catalytic turnover. (A and B) Views of 
FeMoco and the nearby residues oGlu°®° and aPhe** in the proximal (A) and distal (B) aB halves of MoFeP 
in MoFeP (gray), “°Complex-1 (maroon), and “°Complex-2 (blue) structures. Cryo-EM maps for each 
individual structure are shown as a gray mesh and contoured at the following levels: “MoFeP, 0.008; 


“eComplex-1, 0.065; and “°Complex-2, 0.075. E, Glu. 
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(10, 11, 24, 29, 35), implying that it is inherent- 
ly more flexible than other parts of MoFeP. 
Combined with this observation, u °Complex-1 
and “ °Complex-2 structures point to a possible 
role of aIII mobility in nitrogenase catalysis. 
Notably, oIII abuts the docking surface of 
MoFeP for MgADP-bound FeP in DG3 and 
may lie in the trajectory of FeP moving direc- 
tionally across the MoFeP surface during ATP 
hydrolysis (17). Thus, a dynamic olIII domain 
could also provide a direct mechanical con- 
duit between FeP and the proximal FeMoco, 
further linking the timing of nucleotide- 
dependent FeP-MoFeP interactions to redox 
transformations at FeMoco. The increased 
mobility of the oIII domain and the lack of a 
capping FeP could also allow more efficient 
substrate or molecular access to or exit from 
FeMoco. 

Based on the available structures, it is not 
obvious (i) how FeP docking induces confor- 
mational changes in the distal a subunit of 
MoFeP over a distance of >80 A or (ii) to which 
states of the MoFeP cycle (i.e., Ey to Eg) the 
observed nitrogenase complexes correspond. 
It is safe to assume, however, that all copies 
of FeMoco in these complexes represent the 
more stable or longer-lived of the many cat- 
alytic intermediates present in the turnover 
solution. The density maps for the proximal 
FeMoco’s are essentially identical to those for 
the resting-state cofactor (Fig. 3). Thus, an as- 
signment of E, for the proximal cofactors is 
plausible, although E, to E, are also reason- 
able because these states have been proposed 
as hydride-bound FeMoco intermediates (5, 16), 
which may be structurally indistinguishable 
from Ep at the current resolution of these 
cryo-EM maps. At the same time, the obvious 
deviations between the densities of distal and 
proximal FeMoco’s indicate that the distal co- 
factors in “°Complex-1 and “°Complex-2 might 
represent an E, state or a mixture of E, states 
that are different from the proximal cofactors 
and previous crystal structures solved in the 
resting state of FeMoco. It is thus possible that 
this structure represents a more advanced 
state in the MoFeP cycle (i.e., =E,) than the 
proximal FeMoco’s and involves the partic- 
ipation of the Mo center. We also note that the 
overall map density for the distal FeMoco in 
u °Complex-2 is less well defined than in those 
in ™MoFeP, “°Complex-l, or the proximal sub- 
unit in “ °Complex-2, which may further indi- 
cate conformational changes in the cofactor 
during turnover such as the displacement of 
belt sulfurs or ligand binding (12-74). Given 
the asymmetry between the FeMoco’s in the 
two af halves, it is tempting to propose a 
“ping-pong”-type mechanism in which the co- 
factors proceed through each of the eight 
catalytic steps in an alternating fashion. This 
scenario would assign a dual role to FeP: (i) to 
deliver an electron to one af subunit of MoFeP 
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and (ii) to suppress FeP binding to the oppo- 
site oB subunit while priming it for catalytic 
transformations through long-distance activa- 
tion of electron, H*, and/or substrate access 
pathways to the distal FeMoco. Establishing 
whether such a mechanism is operative will 
require future studies, but our current work 
illustrates that it is possible to characterize 
nitrogenase under turnover conditions by cryo- 
EM at near-atomic resolution, representing 
a critical step toward understanding the mech- 
anism of this enigmatic enzyme in full struc- 
tural detail. 
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NANOMATERIALS 


Nanocrystals with metastable high-pressure phases 


under ambient conditions 


Tianyuan Xiao’, Yasutaka Nagaoka’}, Xirui Wang't, Tian Jiang'+, Derek LaMontagne’, Qiang Zhang’, 
Can Cao’§, Xizheng Diao’, Jiahua Qiu’, Yiruo Lu’, Zhongwu Wang”, Y. Charles Cao!“ 


The ambient metastability of the rock-salt phase in well-defined model systems comprising nanospheres or 
nanorods of cadmium selenide, cadmium sulfide, or both was investigated as a function of composition, initial 
crystal phase, particle structure, shape, surface functionalization, and ordering level of their assemblies. Our 
experiments show that these nanocrystal systems exhibit ligand-tailorable reversibility in the rock salt-to— 
zinc blende solid-phase transformation. Interparticle sintering was used to engineer kinetic barriers in the 
phase transformation to produce ambient-pressure metastable rock-salt structures in a controllable manner. 
Interconnected nanocrystal networks were identified as an essential structure that hosted metastable high- 
energy phases at ambient conditions. These findings suggest general rules for transformation-barrier 
engineering that are useful in the rational design of next-generation materials. 


olids from a collection of atoms can 
adopt a variety of structural phases with 
their respective physical and chemical 
properties, providing the foundation 
for materials discovery (1-3). At ambient 
temperature and pressure, there is often one 
thermodynamically stable phase for a given 
atomic collection, and the rest can potentially 
become metastable as kinetically trapped 
phases with positive free energy above the 
equilibrium state (4). However, a general 
strategy for engineering kinetic barriers has 


de1.0 GPa 


15.0 GPa 


8.2 GPa 


Ajisusjzu| 


7.2 GPa 


Log(Intensity) 


6.5 GPa 


6.0 GPa 


2.6 GPa 


0.0 GPa 


1234 15 20 25 30 35 40 


Q (nm") 


Fig. 1. Ligand-tailorable reversibility of the RS-ZB solid-phase 
transformation in superlattices of 4.8-nm WZ CdSe nanocrystals. (A to 

C) WAXS and SAXS patterns collected during compression and decompression 
(de) at different pressures. CdSe NCs were functionalized with octylamine of 
(A) 92% surface coverage and of (B) 62% surface coverage and (C) capped with 
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yet to be developed but is essential for the 
rational synthesis of new materials and for 
expanding the space of synthesizable meta- 
stable materials (5, 6). 

Phase transformations in bulk solids exhibit 
complex kinetics involving different micro- 
scopic pathways that occur in parallel at dif- 
ferent locations within one crystal domain, 
which are thus difficult to determine exper- 
imentally (7). However, transition pressures 
measured in experiments match well with the 
theoretical values determined from electronic 
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structure calculations. For example, the ther- 
modynamic transition in silicon from a dia- 
mond to a B-tin structure was calculated to be 
8.0 GPa, and this transformation was observed 
experimentally in the range of 8.8 to 12.5 GPa 
(8). However, based on elastic stability analysis, 
Mizushima, Yip, and Kaxiras (MYK) theoret- 
ically predicted that defect- and strain-free bulk 
silicon can remain metastable in the diamond 
structure up to 64 GPa, which implies a huge 
intrinsic activation barrier (~0.3 eV per atom) 
in the structural transformation (9). 

This discrepancy indicates that the predicted 
intrinsic energy barrier in an ideal crystal is 
drastically decreased by mechanisms asso- 
ciated with defects in real bulk solids during 
high-pressure experiments and is reflected 
in high-pressure phases of most bulk solids 
relaxing back to the ambient phase upon re- 
lease of pressure. Importantly, the MYK cal- 
culation predicts that most high-energy solid 
phases are theoretically metastable at ambient 
conditions, if their hosting crystals are defect- 
and strain-free (8, 9). This insight allows for 
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a mixture of octylamine and CTAB (with a molar ratio of 5:1) of 63% surface 
coverage in terms of hydrocarbon chains. SAXS patterns are shown in green. WAXS 
diffraction patterns from the WZ/ZB structure and from the RS structure are 
shown in blue and red, respectively. See supplementary materials for details on 
composition-ratio determinations. Q, scattering vector, calculated as 4x sin(26)/A. 
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Fig. 2. Ligand-tailorable ambient metastability of the RS phase in 4.8-nm nanospheres. (A to C) WAXS 
patterns of NC assemblies decompressed from a pressure greater than 10.4 GPa: (A) ZB CdS NCs, (B) 
WZ CdSe/CdS core/shell NCs, and (C) WZ CdS/CdSe core/shell NCs. The type of ligand and surface 
coverage are given on the corresponding curves for octylamine (OcAm) or oleylamine (OAm). Pyridine ligand 
is of ~100% surface coverage, and ODT ligand is in excess amount corresponding to ~150% surface coverage. 
The corresponding WAXS and SAXS patterns obtained at different pressures can be found in figs. S12 to S14 
and S16. See supplementary materials for details on ligand composition-ratio determinations. 


the development of a general approach for 
making ambient metastable materials of given 
chemical compositions based on further under- 
standing of kinetic pathways in solid-solid 
phase transformations (8). 

High-quality colloidal CdSe semiconductor 
nanocrystals (NCs) (10) have been extensively 
used as models to study the phase transition 
from a four-coordinate hexagonal wurtzite 
(WZ) to a six-coordinate cubic rock-salt (RS) 
structure (6, 11-16). Bulk CdSe undergoes a 
WZ-to-RS phase transition at ~2.5 GPa, whereas 
defect-free WZ CdSe NCs can be metastable at 
pressures above 6.0 GPa (6, 11), which provides 
solid evidence for reliability of the MYK cal- 
culation (9). Because these NCs can act as 
single structural domains under high pres- 
sures, their phase-transition kinetics are sim- 
pler than those of bulk solids and are highly 
reproducible (72). Such advantages allowed for 
experimental measurements of fundamental 
properties—such as the energy, volume, and 
entropy of activation—of the WZ-to-RS tran- 
sition in CdSe NCs that revealed direction- 
dependent nucleation through a sliding-plane 
mechanism (13, 14). However, detailed kinetic 
pathways for the reverse phase transforma- 
tion from the six-coordinate RS to the four- 
coordinate cubic zinc-blende (ZB) structure 
remain unclear (14-17). 

There exists evidence that under some con- 
ditions, high-pressure phases can be retained 
in NCs upon release of pressure. At ambient 
conditions, a sample of 11-nm CdSe NCs was 
composed of ~20% metastable RS structure 
(15), and a sample of 13-nm PbS nanocubes was 
composed of 37.2% metastable high-pressure 
phase (8). Additionally, the RS phase of CdSe/ 
ZnS and CdS NCs could fully persist in a 
metastable state at ambient pressure (19-23). 
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These observations suggest the existence of 
unknown mechanisms for eliminating crystal 
defects from high-pressure structures (8, 9). To 
test this hypothesis, we performed a mecha- 
nistic study on the reversibility of the RS-ZB 
transformation in systems comprising spherical 
NCs or nanorods of CdSe, CdS, or both. We ob- 
tained insight into the mechanisms and micro- 
scopic processes that determine the kinetic 
barrier height between two crystal phases. 
The ambient metastability of the RS phase 
in NCs of CdSe, CdS, or both was studied as a 
function of composition, initial crystal phase, 
particle structure, shape, surface functional- 
ization, and ordering level of their assemblies. 
We synthesized six types of spherical NCs with 
4.8-nm diameter and one type of CdSe/CdS 
nanorod with 23.8-nm length and 4.8-nm di- 
ameter that exhibited a narrow size distribu- 
tion and high crystallinity, as confirmed with 
transmission electron microscopy (TEM), 
ultraviolet-visible absorption, and fluores- 
cence spectroscopy (figs. S1 to S5). The NC 
surfaces were precisely controlled through 
ligand exchange with a designed surface den- 
sity. We arbitrarily assigned a surface density 
of five ligands per nm” as 100% ligand cover- 
age. Assemblies of these NCs were prepared 
either directly on the flat surface of a diamond 
anvil or at the air-liquid interface of diethylene 
glycol. To introduce deviatoric stress for promot- 
ing interparticle interactions, high-pressure 
experiments were performed in a diamond 
anvil cell without the use of a pressure medium 
(24-27). The solid-state phase structures as a 
function of pressure were determined with 
simultaneous measurements of both small- 
angle and wide-angle x-ray scattering (SAXS 
and WAXS) at the Cornell High Energy Syn- 


chrotron Source. 


At ambient conditions, octylamine-capped 
WZ CdSe NCs (92% coverage; fig. S6A) formed 
superlattices in a face-centered-cubic (fcc) struc- 
ture with an interparticle distance of 6.76 nm 
(Fig. 1A). Upon increasing pressure, the super- 
lattice unit-cell size decreased gradually until 
7.2 GPa, above which point the superstructure 
distorted, whereas the atomic WZ unit-cell 
volume decreased smoothly up to a threshold 
pressure (~6.0 GPa), at which point there was 
an abrupt decrease in unit-cell volume caused 
by the phase transformation to the RS structure 
(Fig. 1A). Upon decompression, the distorted 
fcc superlattice structure recovered in large 
part, whereas the atomic lattice system com- 
pletely transformed to a four-coordinate ZB 
structure (Fig. 1A). Solid-phase transition of 
the atomic lattice exhibited a hysteresis of 
~6.0 GPa, which was close to those measured 
under a pressure-media environment (6, 1/4). 
By contrast, in the sample of CdSe NCs capped 
with 93% butylamine coverage, a 4.8% RS 
phase was preserved back to ambient pressure 
(fig. S7A). These results suggested that ligand- 
shell thickness would play a major role in 
tailoring the reversibility of the RS-ZB phase 
transition in CdSe NC systems. 

We performed more than 50 synchrotron- 
based high-pressure experiments to investigate 
the ambient metastability of RS CdSe structures 
as functions of amine ligand coverage and alkyl- 
chain length, the extent of long-range order in 
NC assemblies, applied pressure (up to 22 GPa), 
and decompression rate. At optimized condi- 
tions, only 50 to 60% of the RS structures were 
preserved in decompressed CdSe NC samples 
(Fig. 1B and fig. S7B). This result indicated that 
the surface binding of amine ligands may have 
been too strong and minimized effective inter- 
actions between neighboring NCs. However, 
when pyridine, a weakly binding ligand, was used, 
the NC assemblies exhibited nearly no long-range 
order, and no RS structure was retained in the 
NC system decompressed from 15.0 GPa, whereas 
after decompression from 16.2 GPa, 3.9% of RS 
structures were retained (fig. S8). These results 
suggest that the order of NC assemblies should 
play a role in the preservation of RS CdSe 
structures at ambient conditions. 

To weaken the average ligand-binding 
strength while retaining the ability of the 
NCs to form ordered superstructures, we used 
a dual-ligand approach with a mixture of a 
primary amine (as the stronger binding ligand) 
and cetyltrimethylammonium bromide (CTAB) 
(as the weaker binding ligand; fig. S6B). The 
assembly of CdSe NCs functionalized with 
octylamine and CTAB (5:1 with a coverage of 
63%; fig. S6A) formed an fcc superlattice with 
diy-spacing of 5.03 nm (Fig. 1C). The RS-ZB 
transition became completely irreversible in 
these assemblies, and the RS CdSe structure 
was fully preserved at ambient conditions (Fig. 
1C). The lattice constant of the fec superstructure 
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Fig. 3. Superlattice structures of CdSe/CdS WZ nanorods and RS nanorods. (A to D) Superlattices of 
WZ nanorods before compression: (A) TEM image, (B) small-angle electron diffraction (SAED) pattern at 

a 0°-tilt condition, and the wide-angle electron diffraction (WAED) pattern at a (C) 0°-tilt condition with [2110] 
zone axis and at a (D) 30°-tilt condition with [1010] zone axis. (E) 3D atomic nanorod model viewed along 
the zone axis of corresponding ED patterns shown in (C) and (D), respectively. (F to 1) Superlattices of RS 


nanorods: (F) TEM image, (G) SAED pattern at a 0°-tilt condition, and the WAED pattern at a (H) O°-tilt 
condition and at a (I) 45°-tilt condition, superimposed with simulated diffraction spots from the three 


corresponding RS domains (see figs. S18 and S19 for details). (J) 3D atomic model of a double-bend nanorod 


WwW 


th three RS domains (a, B, y) viewed at angles corresponding to those for ED patterns in (H) and (I), 


W 


th labeled zone axes, where u = (v3 1) and v = (v3 


2) at an off-axis direction (see fig. S20 for 


details). (K) Reconstructed 3D superlattice model showing each RS nanorod atomically overlapping with its 
four neighboring rods at corresponding rectangular areas on their 8 domains (fig. S22). The RS structures 
were extremely sensitive to electron beam irradiation and limited our ability to use conventional high- 
resolution TEM to study detailed features in RS nanostructures (fig. S26). 


gradually decreased with a pressure increase 
up to 7.5 GPa. Above this pressure, the super- 
structure started to distort (indicated by a con- 
tinuous expansion of its d-spacing) and then 
irreversibly transformed into a lamellar struc- 
ture at pressures above ~12.0 GPa (fig. S9). 
Upon decompression to ambient pressure, the 
lamellar structure retained its integrity, and 
its d-spacing further increased to 5.81 nm (Fig. 
1C and fig. S9). 

Our TEM observations showed that the RS 
CdSe NCs existed only in the form of aggregates 
(fig. S10C). After sonication, they could be 
redispersed into 4.8-nm colloidal particles in 
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chloroform, but their crystal phase transformed 
into the ZB structure (fig. S10D). Altogether, 
these results suggest that strong interparticle 
associations (such as sintering) may have oc- 
curred within the lamellar structures that could 
be associated with the ambient metastability of 
the RS CdSe structure. Similar ligand-tailorable 
phenomena were observed in CdSe NCs capped 
with butylamine and CTAB ligands (fig. S7C). 
The formation energy of ZB CdSe is 1.0 meV 
per atom lower than that of the WZ form, 
and such NCs were enclosed by crystal facets 
different from those of WZ (28). However, no 
substantial differences were observed between 


4.8-nm ZB and WZ CdSe NCs in both the 
pressure-induced transformation to the RS struc- 
ture and the ligand-tailorable reversibility of the 
RS-ZB phase transitions (fig. S11). Pure ambient 
metastable RS CdSe NCs can also be synthe- 
sized from ZB NCs (fig. S11B), showing that the 
initial crystal phase was not important for the 
ambient metastability of the resulting RS struc- 
tures, and RS NCs that were transformed from 
the ZB and WZ phases could have degenerate 
three-dimensional (3D) atomic fine structures (29). 

Bulk CdS, like CdSe, undergoes a reversible 
solid-phase transformation from a four- to six- 
coordinate structure at a pressure of ~2.5 GPa 
(30). However, CdS NCs exhibited substantial 
differences in the ligand-tailorable reversibility 
of their RS-ZB phase transition when compared 
with their CdSe counterparts (Fig. 2A). Dual- 
ligand functionalization was not necessary for 
making ambient metastable RS CdS NCs. Upon 
decompression from a pressure above 10.4 GPa, 
the RS phase was fully preserved in assemblies 
made from 4.8-nm ZB CdS NCs capped with 
amine ligand coverage as high as 95%, or even 
capped with 100% pyridine ligands, in which 
case NC assemblies exhibited no long-range 
order (Fig. 2A and fig. S12, A to C). By contrast, 
in assemblies of CdS NCs functionalized with 
excess octadecanethiol (ODT) (a stronger bind- 
ing ligand than amines, for isolating CdS NCs 
as separate particles at high pressures), the RS-ZB 
phase transformation was completely reversible 
even when decompressed from 15.2 GPa (Fig. 2A 
and fig. S12D). These results demonstrated that 
the RS CdS phase was not intrinsically metastable 
at the nanometer scale, suggesting that the 
observed metastability in CdS NCs should also 
be related to strong interparticle associations, 
as in the case of RS CdSe NCs. 

To further explore effects induced by chem- 
ical composition and particle structure, we 
synthesized 4.8-nm CdSe/CdS core/shell NCs 
in a WZ or ZB structure from corresponding 
3.4-nm CdSe cores, and 4.8-nm WZ CdS/CdSe 
core/shell NCs from 3.4-nm CdS cores (figs. 
S3 and S4). In terms of the ligand-tailorable 
reversibility of the RS-ZB phase transition, 
both WZ and ZB CdSe/CdS NCs exhibited a 
nearly identical property to CdS NCs (Fig. 2B 
and figs. S13 to S15), whereas the behavior of 
CdS/CdSe NCs was in between that of CdS and 
CdSe NCs (Fig. 2C and fig. S16). Together, 
these results demonstrate that the compo- 
sition of shell and core both play important 
roles in maintaining the RS phase at ambient 
conditions. 

To study NC shape effects, we prepared 
atomically aligned nanorod superlattices at an 
air-liquid interface using 23.8-nm WZ CdSe/ 
CdS nanorods functionalized with 95% octyl- 
amine coverage. These superlattices could be 
indexed as a simple hexagonal structure with 
lattice parameters of a = 6.72 nm and c = 26.2 nm, 
where the atomic WZ lattice of the CdSe/CdS 
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nanorods was coaxially aligned with the super- 
lattice (fig. S17, A to C, and Fig. 3, A to E). After 
the pressure increase to 6.3 GPa, CdSe/CdS 
nanorods exhibited a WZ-RS phase transition 
at a pressure range slightly lower than that of 
the 4.8-nm CdSe/CdS core/shell nanospheres. 

When decompressed from a pressure greater 
than ~10.0 GPa, the RS phase was fully pre- 
served at ambient conditions (fig. S17A). During 
this process, the hexagonal superlattice struc- 
ture was largely retained but was accompanied 
by a substantial d-spacing decrease in the c-axis 
direction, adopting a new set of lattice param- 
eters of a = 6.97 nm and ¢ = 17.5 nm (Fig. 3, F 
and G, and fig. S17A). As indicated by the dot- 
like electron diffraction (ED) patterns, the three- 
dimensionally ordered atomic alignments of 
CdSe/CdS nanorods persisted in the RS phase in 
the superlattices (Fig. 3, H and I). Altogether, our 
experimental data suggest that in this process, 
WZ CdSe/CdS nanorods transformed into 
double-bend RS nanorods with three domains, 
exhibiting a height of ~15.8 nm and an angle of 
125.3° between neighboring domains (Fig. 3, F 
to J, and figs. S18 to S20). This shape change 
involved a shearing motion of the (0001) WZ 
crystal planes, which is consistent with the 
direction-dependent nucleation model of the 


Fig. 4. Electrical conductivity 
as indictor of interparticle 
sintering. (A and B) The 
current-voltage curve for films 
composed of assemblies of 
(A) 23.8-nm WZ CdSe/CdS 
nanorods and (B) 4.8-nm WZ 
CdSe/CdS core/shell nano- 
spheres in (1) the WZ phase 
before phase transition ob- 
tained by decompression from 
.8 GPa, (2) the ambient 
stable RS phase obtained 

by decompression from 

0.4 GPa, or (3) the ZB phase 
hat transformed from RS 

NCs with excess ODT after 
decompression from 

2.5 GPa. (C) Measured 
conductivity is shown in col- 
umn A for nanorods and col- 
umn B for nanospheres. (D and 
E) Schematic of proposed 
interparticle sintering mecha- 
nism for nanorods and nano- 
spheres, respectively: (1) initial 
nanostructures at ambient 
pressure, (2) the RS phase in 
he form of interconnected 

C networks at a high pressure 
(fig. S10, C, G, and K), and 

(3) the RS phase in the form of 
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WZ-to-RS transition proposed by Alivisatos 
and co-workers (12, 14, 16). 

The formation of three-domain nanorods is 
also consistent with TEM observations of the 
ZB nanorods recovered from RS aggregates 
(fig. S10, K and L). The recovered nanorods, 
having sizes similar to those of the nanorods 
before compression, exhibited traces of shape 
deformation and contained multiple crystalline 
domains and curved lattice fringes caused by 
strain (fig. $21). Based on all of the structural 
data, a reconstructed superlattice model showed 
that each double-bend nanorod atomically over- 
lapped with four neighbors at the correspond- 
ing rectangular areas on the surfaces of their B 
domains, suggesting the existence of interpar- 
ticle sintering, which would be responsible for 
the metastability of the RS phase (Fig. 3K and 
fig. S22). Indeed, in the assemblies of CdSe/ 
CdS nanorods functionalized with excess strong- 
binding ODT ligands, the RS-ZB phase transi- 
tion became fully reversible, and no detectable 
RS phase was observed when decompressed 
even from 12.5 GPa (figs. S10J and S17D). 

Additional information regarding interpar- 
ticle sintering was obtained from electrical 
conductivity measurements (37). Films, com- 
posed of either WZ structures recovered before 


Current (mA) 


0.0 0.4 0.8 -0.8 -0.4 


Voltage (V) 


the WZ-RS transition or ZB structures that 
transformed from the RS phase, were insulat- 
ing below the detection limit (<10 pS), whereas 
the films of RS structures exhibited conductiv- 
ities at least eight orders of magnitude higher, 
ranging from 3.0 to 8.0 mS (Fig. 4, A to C). 
These results, together with TEM observations, 
provide evidence that the ambient metastable RS 
phase existed in the form of three-dimensionally 
interconnected and partially fused NC networks 
created by interparticle sintering (Figs. 3 and 4 
and fig. S10). These findings also suggest that 
the observed ligand-tailorable reversibility of 
their RS-ZB phase transition was caused by 
ligand effects during interparticle sintering, 
in which surface-ligand rearrangement was 
the initial step for the surface-atom diffusion 
and chemical-bond formation (32). 

Based on these results, we propose that in- 
terparticle sintering was a main process that 
eliminated crystal defects and relaxed lattice 
distortions from high-pressure RS structures. 
The partial restoration of the intrinsic kinetic 
barrier in ideal crystals (as predicted by MYK) 
created ambient metastable nanostructures (Fig. 
4, D and E). The RS NCs that formed from 
pressure-induced solid-phase transitions com- 
prised a large quantity of high-energy crystal 


A(mS) B (mS) 


-0.0 0.4 08 
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“free” NCs isolated with strong-binding ligands at high pressures. In recovered samples after decompression, either no or low degrees 
of interparticle sintering were observed with TEM (fig. S10, B, F, and J). P, pressure. 
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defects and lattice distortions (12) and served 
as nucleation sites for the rapid RS-to-ZB and 
RS-to-WZ transitions observed in the systems 
with no or low degrees of interparticle sintering 
(fig. S10, B, F, and J) (6, 17-74). In high-pressure- 
and/or deviatoric-stress-induced interparticle 
sintering, solid-state chemical reactions—driven 
by the minimization of Gibbs free energy—take 
place between surface atoms, forming effective 
sinks to absorb local high-energy defects and 
lattice distortions (33). These reactions pro- 
duce grain and/or twin boundaries (GTBs) 
between neighboring particles (fig. S23, A and 
B) to form three-dimensionally interconnected 
nanocrystal networks in which crystal defects 
can be delocalized through stress-driven dif- 
fusion or propagation (34). This process al- 
lowed delocalized and collective GTB-defect 
interactions within an interconnected network 
where GTBs act as sources, sinks, or both to 
eliminate crystal defects through absorption 
and annihilation (note that GTBs also occurred 
within a single double-bend nanorod; fig. S23B) 
(35, 36). 

Moreover, GTBs further acted as obstacles 
to block or jam dislocation motions and stabilize 
and harden interconnected NC networks 
(37, 38) and provided additional mechanisms 
to raise the activation barrier height for the 
RS-to-ZB transformation (fig. S23, C and D). 
These defect-elimination and strain-relaxation 
mechanisms depended on the hydrostatic 
and deviatoric stress applied on the system 
(24, 25). Our experimental findings showed 
that the activation barrier height in resulting 
RS structures was strongly dependent on the 
highest applied pressure, indicated by the ob- 
served partial reversibility of the RS-ZB tran- 
sition in samples decompressed from lower 
pressures (figs. S13, S17, and S24). These results 
further demonstrate a critical pressure for a 
given NC system, above which point the re- 
sulting interconnected NC networks can fully 
retain the RS phase at ambient conditions 
(fig. S24). 

This interparticle sintering (or partial fusion) 
mechanism is in good agreement with the 
entire set of experimental observations as well 
as the MYK prediction (8, 9). Without pressure 
media, the formation of three-dimensionally 
interconnected NC networks strongly inter- 
plays with an emergent phenomenon called 
force chain networks (39), which are created 
by interparticle interactions and by the topo- 
logical patterns of pressure applied within the 
system (39, 40). Thus, the order of NC assem- 
blies affects the degree of interparticle sintering 
as well as the mechanical integrity and strain 
uniformity of the resulting interconnected net- 
works (Figs. 1C and 2C and fig. S8). In disordered 
systems, NC shape anisotropy promotes the 
formation of anisotropic force-chain-network 
architectures (40, 41), yielding strained-NC 
networks and thus imposing additional effects 
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on the ambient metastability of high-pressure 
phases. Indeed, when the ordering level of 
hexagonal-superlattice structures was decreased, 
CdSe/CdS nanorod assemblies retained 77.8% of 
the RS phase, whereas 100% was still preserved 
in totally disordered assemblies of CdSe/CdS 
nanospheres (figs. S14A and S25). In addition, 
the observed composition dependence in the 
ligand-tailorable reversibility of the RS-ZB 
phase transition likely originated from differ- 
ences in the chemical and mechanical proper- 
ties regarding interparticle sintering, as well 
as the formation, elimination, propagation, 
and annihilation of crystal defects and the 
relaxation of lattice distortions in the inter- 
connected NC systems (29, 32-41). 

To date, metastable RS CdSe, CdS, and 
CdSe/CdS samples in our lab have been fully 
preserved under ambient conditions for more 
than 6 years. This extended lifetime shows 
that an apparent activation-energy barrier of 
more than 1.3 eV per particle was created in 
the RS-to-ZB solid phase transformation (8, 20). 
The strategy for engineering transformation 
barriers established in this work is readily 
generalizable for making ambient metastable 
high-energy phases of other materials such as 
IV-VI and III-V semiconductors and transition- 
metal chalcogenides. In addition, our findings 
identify interconnected nanocrystal networks 
as a new form of matter that can effectively 
host metastable high-energy phases at ambient 
conditions. This approach allows for the ra- 
tional design of synthetic methods for meta- 
stable materials through low-pressure routes 
(42), opening new opportunities for materials 
discovery and fabrication (2-5). 
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Cryo-EM structure of the human IgM B cell receptor 


Qiang Su'?*+, Mengying Chen*}, Yan Shi?2++, Xiaofeng Zhang”, Gaoxingyu Huang*, 


Bangdong Huang", Dongwei Liu’, Zhangsuo Liu‘, Yigong Shi 


<1,2,3% 


The B cell receptor (BCR) initiates immune responses through antigen recognition. We report a 
3.3-angstrom cryo-electron microscopy structure of human immunoglobulin M (lgM)-BCR in the resting 
state. IgM-BCR comprises two heavy chains, two light chains, and the lga/lgB heterodimer. The 
ectodomains of the heavy chains closely stack against those of Iga/lgp, with one heavy chain locked 
between Iga and Ig in the juxtamembrane region. Extracellular interactions may determine isotype 
specificity of the BCR. The transmembrane helices of IgM-BCR form a four-helix bundle that appears to 
be conserved among all BCR isotypes. This structure contains 14 glycosylation sites on the IgM-BCR 
ectodomains and reveals three potential surface binding sites. Our work reveals the organizational 
principles of the BCR and may facilitate the design of antibody-based therapeutics. 


cells provide specific and long-lasting 

protection against a variety of patho- 

gens (J, 2). B cell activation, initiated 

by antigen binding to the B cell recep- 

tor (BCR), results in B cell proliferation 
and differentiation into either plasma cells 
or memory cells (3). The BCR consists of a 
membrane-bound immunoglobulin (mIg) and 
two signal-transducing subunits, Iga and Igf 
(4, 5). The five mlg isotypes (A, D, E, G, and 
M) each comprise two heavy chains and two 
light chains (6). Mature naive B cells express 
IgD and IgM, whereas memory B cells express 
IgA, IgG, or IgE (7). Antigen binding to the 
BCR triggers signal transduction by Igo and 
IgB (8), each of which consists of an Ig-like 
extracellular domain (ECD) and an intracellular 
immunoreceptor tyrosine-based activation 
motif (ITAM) (8). 

Upon antigen binding, the BCRs are clus- 
tered on the plasma membrane, leading to 
phosphorylation of the ITAMs (3). ITAM phos- 
phorylation triggers activation of downstream 
kinases exemplified by the spleen tyrosine 
kinase (SYK) and Bruton tyrosine kinase 
(BTK) (3). In addition, antigen-bound BCR 
complexes are internalized and processed, and 
antigenic peptides are presented to helper 
T cells (3). Activation of these kinases and 
participation of helper T cells drive the pro- 
liferation and differentiation of B cells (7, 9). 
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Several different models of BCR activation 
have been postulated, but the underlying mech- 
anisms remain an open question (8, 10, 11). 

The BCR pathway constitutes a key ther- 
apeutic target for B cell malignancies (12, 13). 
Small-molecule BTK inhibitors, such as ibrutinib 
and orelabrutinib, are used to treat several types 
of B cell lymphoma (14, 75). Polatuzumab 
vedotin, an Igf-targeting monoclonal antibody 
conjugated to an antimitotic agent, has been 
approved for the treatment of diffuse large 
B cell lymphoma (/6). Abnormal BCR signaling 
also causes immune disorders, such as type 1 
diabetes and systemic lupus erythematosus 
(17) and agammaglobulinemia (J8). 

Although the structures of sIgs and the 
ECD of IgB have been reported (19, 20), the 
architecture and organization of an intact 
BCR remain to be elucidated. Among all BCR 
isotypes, IgM-BCR is particularly important 
for B cell development and activation of naive 
B cells as well as for oncogenic BCR signaling 
in most B cell malignancies (27). VRCO1-IgM- 
BCR, with the variable domain replaced by 
that of the HIV neutralizing antibody VRC01, 
recognizes gp120 (22). Here, we report the 
cryo-electron microscopy (cryo-EM) struc- 
ture of VRCO1-IgM-BCR (referred to as IgM- 
BCR hereafter) in complex with a Fab fragment 
of polatuzumab. 

An intact IgM-BCR consists of six chains: 
two heavy chains (1), two light chains (1), Iga, 
and Ig. To facilitate purification, we fused the 
recombinant « chain and Iga/IgB to affinity 
tags (fig. S1A). To assist protein folding, we 
coexpressed IgM-BCR with pERp1, which 
promotes oxidative folding of IgM (23). In 
addition, HO, was used to preserve disulfide 
bonds in IgM-BCR. The purified IgM-BCR 
exhibited good solution behavior (fig. S1B). 
We also purified recombinant gp120 monomer 
and gp120 trimer (fig. SIC). As anticipated, 
IgM-BCR recognized both forms of gp120, with 
much greater avidities toward the trimer (fig. 
SID). IgM-BCR formed a stable complex with 
the Fab of polatuzumab (fig. SIE). 


The IgM-BCR complex was analyzed by 
cryo-EM (fig. S2A). We performed a seed- 
facilitated two-dimensional (2D) classification 
and heterorefinement using cryoSPARC (24), 
generating a reconstruction at 3.9 A resolu- 
tion (figs. S2B and S3). After 3D classification 
and refinement, the final reconstruction dis- 
played an average resolution of 3.3 A (fig. S4 
and table S1). The EM maps displayed fine 
features for side-chain assignment (figs. S5 
and $6), with local resolution reaching 2.8 A 
(Fig. 1A). We modeled three Ig-like domains 
(Cu2, Cu3, and Cu4) and a p transmembrane 
region (TMu) for each heavy chain (ua and 
Lp), aS well as the ECDs and TMs for Iga/IgB 
(Fig. 1B). The Fab domain (Vu-Cu1/V«-Cx) re- 
mained poorly resolved, likely reflecting its 
mobile position. In the low-pass-filtered EM 
map (Fig. 1C), two lobes of dumbbell-shaped 
density are connected to that of the Fc do- 
main and may accommodate the Fab domains. 

Another lobe of EM density is connected to 
that for IgB and likely comes from polatuzumab- 
Fab. This conjecture was confirmed by the EM 
reconstruction of the polatuzumab-free IgM- 
BCR (fig. S7, A to D), which closely resembles 
that of the polatuzumab-bound IgM-BCR but 
lacks the suspected density (fig. S7, E and F). 
Thus, polatuzumab binding likely causes little 
structural alteration, consistent with the obser- 
vation that polatuzumab recognizes the dis- 
ordered N terminus of Igf (25). 

Each heavy chain spatially consists of five 
Ig modules (Vu, Cul, Cu2, Cu3, and Cu4), a 
linker segment, Lu, and a transmembrane helix, 
TMu (Fig. 1D). Each light chain comprises two 
Ig modules, V« and Cx, which together with Vu 
and Cul assemble into a Fab domain. Within 
the Fe domain, Cu2, Cu3, and Cu4 from each 
heavy chain are twisted relative to each other 
(Fig. ID). Cu2, and Cu4, interact with Cu2p/ 
Cu3z and Cu4s, respectively. Furthermore, Cu3, 
is spatially separated from Cu3p, resulting in a 
large central hole in the Fc domain (Fig. 1, D 
and E). 

Cu2, associates symmetrically with Cu2,, 
mainly through van der Waals interactions, 
with two Cys®*’ residues forming an inter- 
chain disulfide bond (Fig. 1E). Cu2, also rec- 
ognizes the surface loops of Cu3p,. In particular, 
Are”! and Are? of Cu2, donate three hydro- 
gen bonds (H-bonds) to Asp?”°/Thr**!/Leu* of 
Cu3p, Whereas Leu” of Cu2, accepts an H-bond 
from Thr*”? of Cu3p. Similar to the interface 
between Cu2, and Cu2p, the interface between 
Cu4, and Cu4g is symmetric and dominated by 
van der Waals contacts (Fig. 1F). The residues at 
the interface between the two Cu chains are 
invariant among mammals but show some 
variation in birds and fish (fig. S8A). 

The Fc domain is connected to TMu through 
Lu. TMi, associates with TMuz asymmetrically 
in the membrane. A number of hydrophobic 
residues from TMuz, stack closely against the 
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Fig. 1. Structure of the human IgM- A 
BCR. (A) Cryo-EM reconstruction of the 


human IgM-BCR at an average resolution Resolution (A) N 
of 3.3 A. The local resolution is color- =. ‘ 


coded. The densities are contoured at 
10. (B) The structurally resolved IgM- 
BCR contains lga (gold), IgB (teal), and 
two heavy chains, wa, (gray) and pg (blue). 
Two views of the EM densities (contoured 
at 1lc) are shown. The coloring scheme 
used in this panel is maintained through- 
out. (C) The low-pass filtered (to 15 A) EM 
map of the human IgM-BCR shows the 
ocation of the Fab domains. Two Fab 
domains are docked into the EM density. 
The putative location of polatuzumab is c 
indicated. The densities are contoured at 
4o. (D) Modeled structure of the entire 
mlgM. Each heavy chain comprises 

five Ig-like domains (Vu, Cul, Cu2, Cu3, 
and C4), a linker Lu, and a trans- 
membrane (TM) region. Cu2, Cu4, and 
TM domains in pa interact with the 
corresponding domains in ws. For clarity, 
the ug chain is shown in cartoon. All other 
chains are represented by transparent 
surface. Two orthogonal views are shown. 
(E) Cu2, interacts with both Cu2, and 
Cu3p. The lower panel is a magnified view 
showing the interface between Cu2, 

and the surface loops of Cu3g. H-bonds 
are indicated by black dashed lines. 

(F) Magnified views of the Cu2,/Cu2, 
(top) and Cu4,/Cu4g (bottom) inter- 
faces. (G) A magnified view of the 
interface between TMua, and TMus. 

All structural figures were prepared E 
using UCSF ChimeraX (40) unless 
otherwise indicated. 


180A 


corresponding residues from TMug (Fig. 1G). 
Notably, there are two membrane-embedded, 
interchain H-bonds. The hydroxyl side chain of 
Ser*** from TM, makes bifurcated H-bonds to 
the side chains of Ser®®* and Ser®®* from TMup. 
These H-bonds may help maintain the register 
between TMi, and TMug. The residues at this 
interface are invariant among the heavy chains 
of all organisms examined (fig. S8A). 
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Unlike mIgM, sIgM exists as a pentamer or 
hexamer (19, 26). Both mIgM and sIgM share 
the same sequences in the Cy region. Struc- 
tural comparison between mIgM and sIgM 
(19) reveals a similar conformation for Cu3 
and Cu4, with a root mean squared deviation 
(RMSD) of 1.1 A over 225 aligned Ca atoms 
(fig. S8B). Outside this region, mlgM and sIgM 
display substantial structural variations. Two 


TMu, 


C-terminal B strands (known as tailpieces) of 
sIgM Cu4 mediate sIgM oligomerization (19). 
By contrast, two C-terminal a helices, TMi, 
and TMug, of Cu4 anchor mIgM on the cell 
membrane. 

The Iga/IgB heterodimer is an obligate 
component of all BCRs (4, 6). The primary 
sequences of Iga and Igf are conserved from 
fish to humans (fig. S9, A and B). The Iga/IgB 
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Fig. 2. Structure of the Iga/Igp heterodimer. (A) Overall structure of the lga/lgB heterodimer. Igo. or IgB 
comprises an ECD, a linker, and a TM. Two views are provided. (B) ECDa and ECDA are structurally similar. 
Strand D is positioned quite differently between ECDa and ECDB, however. (C) A magnified view of the 
interface between ECDa and ECDB. Cys! of ECDa and Cys'*© of ECDB form a disulfide bond. H-bonds are 
indicated by black dashed lines. The location of the disease mutation G137S in IgB is indicated by a black 
arrow. (D) The interface between TMa and TMB features several specific H-bonds. In the juxtamembrane 
region, three residues from each TM participate in a network of four H-bonds. In the middle of the TM, 
Glu“® from TMa accepts two H-bonds from the side chains of Gin‘ and Thr’©. (E) The interface between 
TMa and TMB involves extensive van der Waals contacts. 


structure resembles a pair of scissors (Fig. 2A), 
each consisting of an ECD, a linker, and a TM. 
ECDa and ECD8 share 25% sequence identity 
(fig. S9C) and have a similar conformation 
with 0.87 A RMSD over 83 aligned Ca atoms 
(Fig. 2B). The only notable difference is a 
positional shift of ~8 A for strand D. 

Iga interacts with Ig through ECDo/ECDB 
on the extracellular side and TMa/TMB within 
the membrane. Cys"? of ECDa. and Cys!”° of 
ECD8 form a disulfide bond (Fig. 2C), con- 
firming a reported finding (20, 27). The di- 
sulfide bond is buttressed by n-n stacking 
between Tyr’”” of Iga and Phe” of IgB and 
an H-bond between Gly””° of Iga and Arg”? 
of IgB. All these residues are highly conserved 
across species (fig. S9, A and B), suggesting 
conservation of the ECDa/ECDB interface. The 
agammaglobulinemia-causing mutation G137S 
in IgB8 impairs BCR assembly by reducing di- 
sulfide bond formation between Igo and IgB 
(28). In our structure, Gly'” is positioned next 
to the interchain disulfide bond (Fig. 2C) and 
the mutation G137S may perturb the local 
conformation. 
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The TMa/TMB interface registers at least 
six specific H-bonds (Fig. 2D). At the juxta- 
membrane region, the Glu!”® side chain of 
TMa accepts two H-bonds from Asn’”” and 
Lys’°® of TMB. Lys’ and Asn“? of TMa me- 
diate two H-bonds to Asp’? and Lys'®® of 
TMB, respectively. In the transmembrane 
region, Asp'** of TMa makes two H-bonds 
to Gln! and Thr’ of TMB (Fig. 2D). More- 
over, the TMa/TMf interface also features 
extensive van der Waals contacts (Fig. 2E). 
The residues involved in these interactions 
are highly conserved across species (fig. S9, 
A and B), again suggesting a conserved trans- 
membrane interface. The elaborate H-bonds 
between TMa and TMB argue against a marked 
interhelical shift in response to antigen recog- 
nition by BCR. However, this analysis does not 
rule out the possibility that antigen binding 
may lead to a new pattern of specific inter- 
actions in the TM region. 

Consistent with previous characterization 
(5, 29), mIgM noncovalently associates with 
Iga/IgB (Fig. 3A). Remarkably, the linker Lug 
traverses through a hollow structure (referred 


to as the O-ring) that is formed between Iga 
and Igf. This O-ring is sealed on the extra- 
cellular side and the membrane side by ECDa/ 
ECD and TMa/TMB, respectively. This strongly 
suggests that cofolding between the heavy chain 
of mIgM and Igo/Igf occurs during assembly of 
IgM-BCR. Consistent with the structural impli- 
cation, mIgM exhibits no detectable binding to 
the Iga/IgB heterodimer in vitro (fig. SIOA). 
Only through their coexpression and cofold- 
ing in the endoplasmic reticulum can Iga/IgB 
and mlgM assemble into an intact, functional 
IgM-BCR that is targeted to the plasma mem- 
brane (30). 

In the transmembrane region, TMu, and 
TMug are in parallel with TMB and TMa, 
respectively (Fig. 3B). The interfaces between 
TMu,/TMug and TMB/TMa are dominated by 
van der Waals contacts. Nonetheless, there are 
three buried H-bonds within the membrane- 
spanning region: between Tyr®*” of TMup and 
Thr’ of TMo, between Thr” of TMuz and 
Thr’* of TMa, and between Glu*® of TMi, 
and Gin”? of TMB. Tyr®®’ has been shown to 
be critical for complex formation between 
mIgM and Igo/Igf (32). Among these residues, 
Thr?”?/Tyr”®” of TMup and Thr“?/Thr'™ of TMa 
are invariant across species, whereas Glu°® of 
TMu, and Gin’ of TMB are invariant among 
mammals (figs. S8A and S9). 

In the juxtamembrane region, Lu, is oriented 
mainly by three positively charged residues of 
ECD through the O-ring (Fig. 3C). Specifically, 
the guanidinium group of Arg” from ECDB 
donates two H-bonds to Glu and Ala®™, and 
stacks against Phe®™ through a cation-z inter- 
action. Lys” and Are” of ECDB make H-bonds 
to Glu” and Asp*™’, respectively. Thus, Lu ap- 
pears to play a key role in the association with 
Igo/IgB, consistent with its requirement for 
BCR assembly (29). 

On the extracellular side, ECDo/ECDB stacks 
against Cu3p/Cu4g through an extended inter- 
face (Fig. 3D). On one side, Asn” and Thr” of 
ECDo. mediate H-bonds to Gln*®? and Gin*®” 
of Cu4g, respectively. The aromatic side chain 
of Trp” of ECDo. is sandwiched by Phe?”? of 
Cu3_ and Phe** of Cu4,. On the other side, 
ECDB8 only contacts Cu4;. Arg’, Lys®”, and 
Tyr® of ECDB make three H-bonds to Thr*””, 
Thr”°, and Are*” of Cu4s, respectively (Fig. 3D). 
These residues are highly conserved across spe- 
cies (figs. SSA and S9), suggesting a conserved 
mode of interaction. 

To corroborate these findings, we assessed 
the impact of mutations on IgM-BCR assembly. 
Consistent with our structural observations, 
mutations affecting the TM interfaces between 
the u chain and Iga/IgB (T573A/Y587A, F577W, 
or F581W in u chain; L152W or A156W in Igo; 
T161W in IgB) blocked IgM-BCR assembly (fig. 
S10B). Furthermore, mutations (D553A/E565A 
in Lug; R55A/K56A/R57A in ECDB) affecting 
the interface between Lug and ECD6 also 
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Fig. 3. Assembly of mlgM and Igo/lgf into intact IgM-BCR. (A) The heavy 
chain ug traverses through a hollow structure (O-ring) formed by the 
Iga/lgB heterodimer. Two perpendicular views are provided. The up chain is 
displayed in cartoon. All other chains are represented by surface. (B) The TMs 
of mlgM and Iga/lgB form a four-helix bundle. The pairwise interfaces for 
TMup/TMa and TMua/TMB are shown in the center and right panels, 


crippled IgM-BCR formation. By contrast, mu- 
tations affecting the interface between Cu4g 
and ECDa/B (Q487A/Q493A and T530A/T533A 
in u chain) had little impact on IgM-BCR for- 
mation (fig. S10B). Thus, the TMs and Lug—but 
not the ectodomains particularly—play a key 
role in the assembly of IgM-BCR. 

IgM-BCR is heavily glycosylated (27, 29). Our 
EM map allows identification of 14 glycans (fig. 
S6F). ECDoa has six N-linked glycosylation sites 
(Asn®”, Asn®, Asn”, Asn®%, Asn®”, and Asn”), 
whereas ECDB has two such sites (Asn and 
Asn’©}) (Fig. 4A). Three additional glycosylation 
sites are located on each heavy chain: Asn®”” on 
Cu2 and Asn®”?/Asn*” on Cu3 (Fig. 4B). Analysis 
of glycan distribution reveals three surface 
patches that are free of glycosylation (Fig. 4C). 
Patch-1 is formed by the surface of ECDo and 
Cu4g on one side of IgM-BCR. Patch-2, located 
on the opposite side of Patch-1, consists of the 
surface of ECDB and Cu4g. Patch-3 covers the 
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surface between ECDoa and ECD and overlaps 
with Patch-1. 

These patches represent potential epitopes 
for recognition by antibodies or miniproteins. 
Each patch contains one or more hydrophilic 
surface loops (Fig. 4D and table S2), which 
display high scores of antigenicity (32). How- 
ever, recognition by bulky antibody may be 
hindered by steric hindrance, as these loops 
are relatively short and appear to be ordered. 
Consistent with this notion, the N-terminal 
flexible sequences of Igfh constitute the only 
recognition site for polatuzumab and other 
antibodies to IgB (25). Nonetheless, smaller 
binders such as designed nanobodies and mini- 
proteins may serve as other viable alternatives. 

The residues at the interchain interfaces of 
IgM-BCR are conserved from zebrafish to hu- 
mans (figs. S8A and S9), suggesting evolution- 
ary conservation of the IgM-BCR assembly. In 
particular, key residues at the TMu,/TMug 


respectively. (C) A magnified view of the accommodation of Lug in the 
O-ring. Residues from Lug primarily engage three positively charged residues 
(Arg??, Lys°®, Arg®”) of ECDB in the juxtamembrane region. (D) ECDo/ECDB 
directly interacts with Cu3g and Cu4g. Left: Surface representation of this 
interface. Center and right: Magnified views between ECDa and Cu4s/Cu3, 
and between ECD8 and Cu4p. 


interface are almost invariant in the five mlgs. 
Thus, the register-maintaining H-bonds ob- 
served in mIgM are most likely preserved in 
the other four BCRs. Tyr”, invariant among all 
five BCRs, plays a key role in anchoring TMa 
(Fig. 3B). This strongly suggests a conserved 
interface in the TMs of all five BCRs. The 
structure of Cu4 is almost identical among 
the five mlgs (fig. SIIB) (79, 33-35). However, 
the residues at the interface between Cu4 and 
ECDa/ECDB8 show major variations (fig. S11A). 
Thus, a conserved interface between mlIg and 
Iga/IgB appears to exist in the TM region, but 
not on the extracellular side, among the five 
BCRs (36). Accordingly, the TM shows greater 
sequence conservation than the ectodomains 
(fig. S11C). 

The molecular basis for different signaling 
properties of the BCRs is partially suggested by 
structural and sequence analysis. The sequences 
in the linker and ectodomain are highly variable 
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Fig. 4. N-linked glycosylation sites in the ectodomains of IgM-BCR. 

(A) Eight glycosylation sites—six on ECDa and two on ECDB—are structurally 
identified in the Iga/lgB heterodimer. (B) Structural identification of six 
glycosylation sites in the Fc domain. Two sites are located on each Cu3 and 
one site on each Cu2. (C) Identification of three surface epitopes in IgM-BCR 
for recognition by antibody or miniprotein. Three views of the glycosylated 
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IgM-BCR are shown. The glycans are represented as magenta surfaces. 
The glycan-free surface epitopes are indicated by black dashed circles. 

(D) Magnified views of the three surface epitopes. Left: Hydrophilic surface 
loops are colored red. Right: These surface loops are enriched by charged 
amino acids. The glycan-free candidate epitopes are indicated by yellow 
dashed circles. 


among different classes of mlg. Therefore, the 
Igo/Ig8 binding mode of the ectodomain is 
likely to differ among the different BCR classes. 
This has been partially confirmed by the de- 
scribed structural variation between IgM-BCR 
and IgG-BCR (36). Such differences in ecto- 
domain engagement may constitute a form of 
conformational signaling. 

The inhibition of BCR signaling constitutes 
an effective strategy for disease treatment (12). 
Patch-1 and Patch-2 each contains a sizable 
surface area from both mIgM and Iga/IgB 
(Fig. 4D), constituting a binding site specific 
for IgM-BCR, but not for sIgM or other BCRs. 
A specific binder conjugated to an antitumor 
drug could potentially be designed for the 
depletion of IgM” B cells. Moreover, a specific 
binder could be fused to a Fc fragment for 
ligation of IgM-BCR with its inhibitory co- 
receptor, FcyRIIb, to suppress IgM-BCR sig- 
naling in chronic lymphocytic leukemia (37). 
Thus, our structure reveals concrete clues for 
future drug discovery efforts. 

During BCR activation, co-receptors such 
as CD19, CD81, and CD20 are thought to as- 
sociate with IgM-BCR in the TM region (27). 
The mutation Y587V/S588V in the TM of mIgM, 
which impairs its association with Iga/Igp, 
suppresses downstream signaling events de- 
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spite normal antigen recognition (27, 31). Thus, 
small molecules that bind the conserved TM 
region may negatively affect assembly of IgM- 
BCR and block BCR signaling. 

The stoichiometric ratio between mIg and 
Iga/IgB was originally thought to be 1:2 (4, 38), 
but subsequent studies suggested the ratio to 
be 1:1 (5, 39) (fig. S12A). In our resting-state 
IgM-BCR structure, the ECDs of Iga/Igf bind 
Cu4p, while Cu4, is close to the membrane. If 
Cu4, also binds Iga/IgB, the ECDs of Iga/IgB 
will sterically clash with the cell membrane. 
Thus, the structural features of IgM-BCR are 
incompatible with the 1:2 molar ratio (fig. S12B). 

Several models of BCR activation have been 
proposed, including the dissociation activation 
model, conformation-induced oligomerization 
model, and collision coupling model (8, 10, 11). 
Our current structure does not conflict with 
such models, and additional studies will be 
needed to definitively resolve this ongoing 
debate. For recombinant BCR in solution, the 
Fab domain is quite flexible relative to the Fc 
domain, and antigen binding is unlikely to 
transduce a conformational change from the 
Fab to the Fc domain in solution. However, 
endogenous IgM-BCR is restricted to the cell 
membrane, which is more conducive to binding- 
induced conformational changes. In addition, 


co-receptors may contribute to force transmis- 
sion. Elucidation of the activation mechanism 
may be greatly facilitated by structural determi- 
nation of the activated BCR from native B cells 
by cryo-EM and direct visualization of the 
activated BCR in cells by cryo-electron tomog- 
raphy (cryo-ET). 
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IMMUNOLOGY 


Cryo-EM structures of two human B cell 


receptor isotypes 
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Fan Zhang, Changyou Guo, Zhiwei Huang* 


The B cell receptor (BCR) complex plays a critical role in B cell development and immune responses. The 
assembly mechanisms underlying the BCR complex remain unknown. We determined the cryo-electron 
microscopy (cryo-EM) structures of human IgG-BCR and IgM-BCR, which consist of membrane-bound 
immunoglobulin molecules (mlg) and Iga/B subunits at a 1:1 stoichiometry. Assembly of both BCR complexes 
involves their extracellular domains, membrane-proximal connection peptides, and transmembrane (TM) 
helices. The TM helices of mlgG and mlgM share a conserved set of hydrophobic and polar interactions with 
lga/B TM helices. By contrast, the IgG-Cy3 and IgM-Cy4 domains interact with extracellular Ig-like domains of 
lga/B through head-to-tail and side-by-side modes, respectively. This work reveals the structural basis for 


BCR assembly and provides insights into BCR triggering. 


cell immune responses involve recog- 

nition of antigens by B cell receptors 

(BCRs) (1-3). The BCR structure com- 

prises a homodimeric antigen-binding 

membrane-bound form of immuno- 
globulin (mIg) and a membrane-bound sig- 
naling apparatus of Iga/f (CD79a/CD79b) 
heterodimer at a stoichiometry of 1:1 (4, 5). 
mlg consists of a tetramer with two identical 
membrane-bound heavy chains and two iden- 
tical light chains (fig. SIA). The four subunits 
of disulfide-linked homodimeric heavy chains 
of mlg and disulfide-linked heterodimeric Iga/B 
in the BCR complex are type I transmembrane 
proteins. The BCR can be divided into five 
major classes according to their various heavy 
chains (6-8). 
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The BCR complex plays a critical role in 
B cell development and function (9). Antigen 
engagement of the BCR induces phosphoryla- 
tion of intracellular immunoreceptor tyrosine- 
based activation motifs (ITAMs) in the Iga/B 
subunits by the kinase Lyn. The phosphoryla- 
tion event then triggers a cascade of signaling 
events, resulting in B cell activation, differenti- 
ation, and antibody production (JO). 

Several models of BCR triggering have been 
proposed (JO, 17). The cross-linking model pro- 
poses that BCRs redistribute to form micro- 
clusters for signaling upon BCR engagement 
with multivalent antigens (72). Another, the 
conformation-induced oligomerization model 
(0, 13), contends that antigen engagement of 
BCRs induces a conformational change in the 
Fc domain of mlg, thereby triggering BCR 
oligomerization and signaling (14, 15). 

Over the past few decades, many structures 
of antibodies and antigen recognition by anti- 
bodies have been reported (16-19). However, 


the organization and structure of the entire 
BCR complex has remained undetermined. 
The assembly mechanism for the BCR com- 
plex is important for our understanding of 
B cell activation but also for the development 
of effective immunotherapies targeting this 
complex. 

We subjected a human IgG-BCR (VRCO1) 
that targets the receptor binding site within 
HIV-1 gp120 (20) to cryo-electron micro- 
scropy (cryo-EM) analysis. Using a modified 
method for purification of the T cell receptor 
(TCR) (2D, we reconstituted the IgG-BCR com- 
plex containing the heavy and light chains of 
mlgG and Igoa/f subunits (fig. S1B). Three- 
dimensional reconstruction of the reconsti- 
tuted IgG-BCR complex by single-particle 
analysis produced a cryo-EM map at a reso- 
lution of 3.0 A (Fig. 1A and fig. $2). This re- 
construction enabled us to build models for 
the Fc domains, the connecting peptides 
(CPs) and transmembrane (TM) helices of 
mlgG, and the two extracellular Ig-like do- 
mains, CPs, and TM helices of Iga/ (Fig. 1B, 
fig. S4A, and table S1). 

The human IgG-BCR complex structure is 
composed of the heterodimeric Iga/f and mIgG 
with a 1:1 stoichiometry, consistent with previ- 
ous studies (4). The conical structure of the BCR 
complex measures ~145 A in height (Fig. 1B) 
with the Cy2 and Cy3 domains on top and the 
four parallel TMs forming a helical bundle-like 
structure below. The two Ig-like domains of 
Iga/B interacting with Cy3 domains and CPs 
of mIgG constitute the central component of 
the BCR complex (Fig. 1B). The Fe, which links 
the two Fabs, is positioned distal and perpen- 
dicular to the plasma membrane (PM). The 
cytosolic tails of the BCR subunits and the two 
Fab domains are not visible, presumably be- 
cause of structural flexibility (Fig. 1B). Six 
N-linked glycans at residues of asparagine 
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Fig. 1. 3D reconstruction A 
and atomic model of the 
human IgG-BCR complex. 
(A) Side view of the final 
cryo-EM map of the human 
IgG-BCR complex. Color codes 
for the subunits of the 
complex are indicated. 

(B) Different views of the 
overall structure of the 
IgG-BCR complex shown in 
cartoon. Color codes for the 
subunits of the IgG-BCR 
complex: light blue, IgG heavy 
chain_A; yellow, IgG heavy 
chain_B; green, Igo; 

magenta, lgB. 


are also well defined in the EM density (Fig. 1B 
and fig. S5). 

The two heavy chains of mIgG form a 
homodimer mediated by the Fc (residues 
Gly**' to Leu**”), CP (Ser**® to Asp*®®), and 
TM (Gly*” to Phe*®”) domains (Fig. 2A). The 
two Fc domains adopt a classic Ig fold and 
interact with one another (Fig. 2A) to form 
a classic horseshoe-shaped structure that is 
nearly identical to the ligand-free Fc domain 
of VRCO1 antibody (19) (fig. S6). Thus, the 
conformation of the Fc domain is conserved 
in both secretory antibody and membrane- 
bound forms. The homodimeric TM helices of 
the two heavy chains arrange with a helix- 
crossing angle of ~15° in the membrane. These 
two TM helices form extensive interactions 
mainly via hydrophobic contacts (Fig. 2A, 
right). Moreover, a Ser**?-mediated hydrogen 
bond is formed between TM_A and TM_B 
(Fig. 2A). The interactions between the homo- 
dimeric heavy chains are further strengthened 
by a disulfide bond, Cys**’-Cys*”, at the center 
of the CPs (Fig. 2A). 

The Iga/B heterodimer—like the CD3ye and 
CD3ée heterodimers of the TCR-CD3 complex— 
contains a pair of Ig-like domains located in 
Leu®*-Val’”° for Igo, and Ser**-Met"*? for Ig 
(Fig. 2B). The interface between the two extra- 
cellular domains of Iga/B is mainly mediated 
by a disulfide bond between Igo-Cys”? and 
Igf-Cys'*° (Fig. 2B). Two hydrogen bonds be- 
tween the main chains of Pro®’ and Gly'”° 
from Igo and the side chain of Arg”! from Ig 
at the center further stabilize the interface 
(Fig. 2B). Additionally, hydrophobic packing 
of Phe’? from Ig against its neighboring 
residues from Igo also contributes to Iga/B 
heterodimerization. The Iga_TM (residues 
Glu* to Trp'®’) and IgB_TM (residues Gln’? 
to Asp'®”) helices form a coiled-coil structure 
and make extensive hydrophobic contacts be- 
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tween them (Fig. 2B). The coiled-coil structure 
is further stabilized by polar interactions and 
van der Waals packing around the N-terminal 
sides of the two TM helices (Fig. 2B). A 
negative-charged residue Glu"“® of Iga is par- 
tially exposed to the membrane. Moreover, a 
portion of the preceding CP (residues Pro’? 
to Gly”) of Iga packs against the TM helix of 
IgB in a nearly parallel manner, further stabi- 
lizing the Iga/f interaction (Fig. 2B). A homo- 
zygous point mutation of G135S in IgB in a 
patient with immunodeficiency and defect in 
B cell development is consistent with our ob- 
servation that this mutation is located at the 
interface of Igo/B heterodimer (22). 

The extracellular assembly of the IgG-BCR 
complex involves interactions between the Ig- 
like domains of Iga/f and the disulfide bond- 
linked y_CPs (Fig. 3A). y_CP_A binds along 
the interface between the Ig-like domains of 
Iga/B, with its N-terminal portion tightly pack- 
ing against the protruding CD loop (residues 
Val® to Pro”) of Iga on one side and Trp” of 
IgB on the other (Fig. 3, B and C). Additionally, 
the two aromatic residues, Tyr“ and Trp”, 
of the CD loop make stacking contacts with 
Pro*”® residues from the two Cy3 domains. A 
main-chain hydrogen bond between Iga-Trp” 
and Cy3_B-Are*® also contributes to the CD 
loop-mediated interactions. The CD loop of 
Iga binds to the interface of homodimeric 
mlgG, thereby occluding another copy of Iga/B 
to bind to the same site. Thus, it precludes a 
2:1 stoichiometry between Igo/f and mIgG. 
The central portion of y_CP_A preceding resi- 
due Cys*” of y_CP_A makes three main-chain 
hydrogen bonds with a Bf strand (residues 
Arg”! to Lys”) of IgB, forming a B sheet-like 
structure (Fig. 3D). The C-terminal side of 
y_CP_A packs against the CP of IgB. By com- 
parison, the N-terminal side of y_CP_B largely 
interacts with Igo, whereas its central portion 


Intracellular “ 


is sandwiched between Iga/f at the membrane- 
proximal side (Fig. 3D). Assembly of the CP 
region of the IgG-BCR complex is mainly 
organized by the C-terminal side of y_CP_B. 

The two Cy3 domains are also involved in 
mediating assembly of the IgG-BCR complex. 
A short o helix (Lys*!’-Gly*”*) between the 
B_E and B_F strands of Cy3_A tightly packs 
against the N-terminal loop region of Igf 
(Fig. 3C). Two aromatic residues of IgB, Tyr®™ 
and Trp*”, stack against the N-terminal side 
of the short o, helix. The loop region (GIn’®?- 
Tyr) linking the B_F and B_G strands of 
Iga establishes contacts with the outer surface 
side of the B sheet. Furthermore, the Ig-like 
domains of Iga/f also interact with the 
N-termini of TM segments. The two intra- 
chain disulfide bonds, Cys**’-Cys*” and Iga- 
Cys"°-IgB-Cys'° (Fig. 3, A and D), further 
reinforce the ternary interaction between 
mlgG and Iga/f. The interaction between 
mlgG and the Igo/f heterodimer was greatly 
compromised by mutations of W76R, P77W, 
and R124A of Iga, F52W and W4’7A of IgB, 
or E454A, E455A, and E459A of the heavy 
chain of mIgG (fig. S7), which demonstrates 
the functional relevance of these molecular 
interactions. 

The two Iga/B_TM helices establish exten- 
sive hydrophobic interactions with the two 
y_TM helices, resulting in formation of a 
four-helix bundle structure in the membrane 
(Fig. 3E). The two face-to-face y_TM helices 
interact with the Iga/B_TM helices at one 
side, leaving the other side exposed to the 
PM. The Igo_TM helix interacts with both 
y_TM helices of mIgG, forming a three-helix 
bundle structure. By comparison, the IgB_ TM 
helix only contacts the y_TM_A helix, which 
allows the y_TM_B helix to mainly interact 
with Iga_TM helix in the membrane. The Iga/ 
B_TM segment packs against the y_TM_B 
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Fig. 2. Structures of the mlgG and Iga/B subunits. (A) Left: Side view of the structure of the two 
component chains of the mlgG homodimer. Right: Detailed interactions within the mlgG homodimer. TM and 
CP indicate transmembrane and connecting peptide, respectively. Light blue, heavy chain_A; yellow, heavy 
chain_B. (B) Structure of the lgo./B subunits. Shown in the middle is a side view of the structure. Detailed 
interactions between the two lIg-like domains and TM segments of Iga/B are shown on the left and right, 
respectively. Polar interactions are indicated by dashed lines. Iga and IgB subunits are colored green and 


magenta, respectively. 


helix via extensive hydrophobic interactions 
(Fig. 3F). The carbonyl oxygen atom of Val” 
from Igo forms a hydrogen bond with Tyr*®° 
of y_TM_B (Fig. 3F). Iga/B_TM helices con- 
tact the y_TM_A helix through hydrophobic 
packing, forming the second three-helix bundle 


structure in the PM (Fig. 3G). In addition to 
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hydrophobic residues, polar interactions located 
at the N terminus of the helix bundles of Iga/ 
6B_TM and y_TM (Fig. 3, F and G) contribute to 
the assembly of the BCR subunits in the mem- 
brane. This is reminiscent of the charged 
residues participating in TM helix interactions 
between TCR and CD3 subunits (27). Those 


residues from y_TM_A or B chain involved in 
interactions with Iga/B are conserved among 
the five classes of mIg-BCR heavy chains, which 
suggests a conserved TM structure of these 
BCR complexes. Mutations of conserved resi- 
dues mIgM-Tyr®°/Ser™ [equivalent to Tyr**°/ 
Ser of mIgG (fig. S6)] completely disrupt the 
association of mIgM with Iga/f as well as BCR 
clustering and intracellular calcium responses 
(14, 23), highlighting the critical role played by 
polar interactions in the IgG-BCR assembly. By 
contrast, mutations of mIgM-Ser®**/Thr°*? 
(equivalent to Thr*”?/Ie* of mIgG), which 
are not involved in interaction with Igo/B sub- 
units in the complex, have no effect on BCR 
signaling (24). 

IgG- and IgM-BCR have distinct sensitivities 
and thresholds to antigen engagement (25, 26), 
suggesting their structural differences. In addi- 
tion to different cytosolic tails, the flexible 
hinge region connecting Fab and Fc of mIgG 
is replaced by a Cu2 domain in mIgM (fig. S8). 
To investigate the assembly mechanism of IgM- 
BCR, we determined the cryo-EM structure of a 
human IgM-BCR complex at 3.6 A (figs. S1 and 
$3 and table S1). The quality of the density was 
sufficient for sequence assignment of the extra- 
cellular Fe domains (Cu2 to Cu4: domains), CP 
region, and TM helices of mIgM and the two 
extracellular Ig-like domains, CP region, and 
TM segments of Igo/B subunits (Fig. 4, A and B, 
and figs. S3 and S4B). Two densities adjacent to 
the Fc domain corresponding to two Fab do- 
mains of mlgM were visible at low thresholds 
(fig. S9), which suggests that they are relatively 
flexible. We identified 14 N-linked glycosylation 
sites on the extracellular domains of Iga/8 and 
mIgM (fig. S5B). 

Like Iga/f and mIgG in the IgG-BCR com- 
plex, Iga/B and mIgM have a 1:1 stoichiom- 
etry in the IgG-BCR complex. The asymmetric 
structure of mIgM_Fc (Cu2 to Cu4 domains) 
is composed of two contacting homodimers 
of Cu2-Cu2 and Cu4-Cu4 at the membrane 
distal and proximal ends (Fig. 4, A and B), 
respectively. The overall heights of IgG-BCR 
and IgM-BCR are largely similar, although the 
IgM-BCR contains two Cu2 domains (Fig. 4C 
and fig. S1A). As predicted, the Cu2 domain 
assumes a typical Ig structure and forms a 
homodimer in the IgM-BCR (Fig. 4, B and C). 
The mIgG_Fc dimer has a structure similar to 
that of mIgM_Fc, but the positioning differs 
strikingly between the two complexes (Fig. 4C). 
As a result of its shortened CPs, the Fc of 
mIgM is much more proximal to the PM as 
compared to that of mIgG. Likely for this 
reason, the arrangement of the two CPs of 
mlgM is markedly different from that of the 
CPs of mIgG. Despite the differences, the TM 
helices of IgM are positioned similarly to 
those of IgG (Fig. 4D). The overall structure 
of the Igo/B subunits in IgM-BCR resembles 
that of Iga/8 in the IgG-BCR complex, with an 
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Fig. 3. Extracellular and transmembrane assembly of the IgG-BCR complex. 
(A) Side view of the extracellular domains of the IgG-BCR complex. Red and blue 
boxes highlight the mlgG and Iga/B interfaces, respectively. Iga, IgB, heavy chain_A, 


and heavy chain_B are colored green, magenta, light blue, 


RMSD of 1.92 A over 268 Co, atoms (fig. S10A). 
In the IgM-BCR structure, the interactions 
between mIgM and Iga/f—as observed for 
the IgG-BCR—are mediated through extra- 
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(B and C) Detailed interactions of the interfaces highlighted in (A). (D) Detailed 
interactions of CP regions between mlgG and |ga/B modules. (E) Top view of the 
TM segment of the IgG-BCR complex. (F and G) The intramembrane interfaces 


and yellow, respectively. between mlgG and Iga/B modules. Hydrogen bonds are shown as dashed lines. 


cellular mIgM_Fc contact with Ig-like do- | Iga/B within the PM are largely conserved in 
mains of Iga/fB and mIgM_TM helices packing | the IgG-BCR complex (Fig. 4D and figs. S10, 
against Igo/B TM helices in the PM (Fig. 4, B | B to D, and S11). The residues of mIgM and 
and C). The interactions between mIgM and | mlgG involved in interaction with Igo/B_TM 
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Fig. 4. Structural comparison of the human IgM-BCR and IgG-BCR. (A) Side 
view of the cryo-EM map of the human IgM-BCR complex. Iga, IgB, mlgG_A, 
mlgG_B, mlgM_A, and mlgM_B are colored green, magenta, light blue, yellow, 
purple, and gold, respectively. (B) Different views of the overall structure of the 
IgM-BCR complex shown in cartoon. Color codes for subunits are the same as 


Human-migG1 
Human-migM. 
Human-migE 
Human-migA1 
Human-migD 


those shown in (A). (€) Structural comparison of IgM-BCR with IgG-BCR. 

(D) Overlaid IgM-BCR_TM and IgG-BCR_TM. Conserved residues are shown as 
sticks. Left: Sequence alignment of heavy chains from five BCR isotypes. 
Right: Conserved and similar residues are highlighted in red and yellow, 
respectively. (E) Close-up view of the extracellular domain interfaces of IgM-BCR. 


are conserved among all five mlg classes and Notable differences exist in the extracellular | only mIgM_A interacts with the Ig-like do- 
are similarly positioned (Fig. 4D). Thus, all | assembly of the IgG-BCR and IgM-BCR com- | mains of Iga/f. Moreover, the Cu4 domain of 
five isotypes of BCR may share a conserved | plexes. The mIgG homodimer plays a role in | IgM-BCR rather than the CP as seen in the 
conformation in the membrane. the extracellular assembly of IgG-BCR, whereas | IgG-BCR complex makes simultaneous contacts 
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with the Ig-like domains of Iga/B from IgM- 
BCR in a side-by-side mode (Fig. 4C), which is 
reminiscent of recognition of TCR by CD3 
subunits (27). Despite these differences, the 
Ig-like domains of Iga/B use the same surface 
side for interaction with mIgM_A. Notably, 
the protruding CD loop of Iga packs against 
the concave surface of Cu4_A (Fig. 4E) via 
hydrophobic contacts (Fig. 4E). At the periph- 
ery, Glu” of Iga and Tyr® and Lys” of Ig 
establish three hydrogen bonds and a salt 
bridge with Asn®’’, Arg”**, Gin°”, and Glu**” 
of Cu4_A (Fig. 4E), respectively, stabilizing 
the extracellular assembly of the BCR complex. 
Structural superimposition revealed that the 
CD loop of Igo. swings ~90° to contact IgG-Cy3 
and IgM-Cu4 (fig. S10A), respectively. 
IgG-BCR_Cy3 and IgM-BCR_Cu4 domains 
are required for antigen-induced BCR oligo- 
merization (73). The Cu4 and Ca3 domains 
of secretory IgM and IgA, respectively, are 
involved in pentamerization. However, the 
Cu4 domain of IgM-BCR is likely inaccessible 
for oligomerization in the resting state (13). 
Structural superposition of IgM-BCR and the 
secretory IgM (sIgM) (PDB 6KXS) (or sIgA, 
PDB 7KOC) pentamer (16, 17) revealed that 
Iga completely overlaps with the neighboring 
sIgM_Cu4 domain (fig. S12). Moreover, the 
FG loop of Iga overlaps with the FG loop of 
Cu4 of slgM_Fc2, which is involved in the 
direct contacts within each slgM_Fe monomer. 
Future studies will be needed to establish 
whether the pentamerization surface of slgM_ 
Cu4 is also required for BCR oligomerization. 
BCR clustering induced by antigen en- 
gagement is shown to be an early and critical 
event (77). Although the Cu4 domain of IgM- 
BCR is not accessible for oligomerization in 
the absence of antigen, antigen binding to 
the BCR may bring the Cu4 domain into an 
oligomerization-receptive conformation (73). 
Moreover, it has been shown that the Cu4 
domain of mIgM or the Cy3 domain of mIgG 
spontaneously forms clusters and activates 
B cells (/4). These data and our cryo-EM struc- 
tures lead us to propose that the extracellular 
domains of Iga/8 bind to Cu4 or Cy3 to steri- 
cally block mIg oligomerization in the resting 
state. Antigen engagement may impose me- 
chanical forces on the Fab domains to trigger 
the structural changes in mIg_Fc, thereby 
releasing the clustering interface of Cy3 or 
Cu4 occupied by Iga/B, leading to IgM-BCR 
clustering and downstream signaling. In con- 
trast to the conserved assembly mechanism 
between Iga/f and mIg isotypes (27), the 
different extracellular assemblies observed 
in the IgG-BCR (head-to-tail mode) and IgM- 
BCR (side-by-side mode) (Fig. 4C) complexes 
and the distinct flexibility of the Fab domains 
(fig. S9) may inform their distinct signaling 
features (28, 29). However, the underlying 
mechanism remains an open question. 
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QUANTUM PHYSICS 


Tweezer-programmable 2D quantum walks in a 


Hubbard-regime lattice 


Aaron W. Young’, William J. Eckner’, Nathan Schine’, Andrew M. Childs??, Adam M. Kaufman?* 


Quantum walks provide a framework for designing quantum algorithms that is both intuitive and 
universal. To leverage the computational power of these walks, it is important to be able to programmably 
modify the graph a walker traverses while maintaining coherence. We do this by combining the fast, 
programmable control provided by optical tweezers with the scalable, homogeneous environment 

of an optical lattice. With these tools we study continuous-time quantum walks of single atoms on a square 
lattice and perform proof-of-principle demonstrations of spatial search with these walks. 

When scaled to more particles, the capabilities demonstrated can be extended to study a variety of 
problems in quantum information science, including performing more effective versions of spatial search 


using a larger graph with increased connectivity. 


he ability of quantum systems to co- 
herently explore their Hilbert space, 
exhibiting wavelike superposition and 
interference, is a key ingredient in quan- 
tum algorithms. Quantum walks are 
one intuitive framework for understanding 
the algorithmic speedups that these ingre- 
dients can provide. In this framework, states 
in Hilbert space are mapped to the locations 
of a walker on a graph (1); the walker can then 


traverse this graph through a superposition 
of interfering paths. Even in the restrictive 
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case of real, equal-valued couplings and local, 
time-independent control, this simple framework 
is capable of universal quantum computation 
(2, 3) and has inspired the development of 
various quantum algorithms including those 
for spatial search (4), graph traversal (5), ele- 
ment distinctness (6), and formula evaluation 
(7). When scaled to many particles, systems 
that realize quantum walks further allow for 
rich studies of quantum information and 
many-body physics: controlled tunneling 
of many noninteracting particles maps to 
sampling problems of interest in complexity 
theory (8, 9) and the combination of inter- 
actions and itinerance underlies a broad class 
of condensed-matter Hubbard models (10, 11). 
In this work we introduce a platform for 
realizing programmable quantum walks and 
lattice models that combines favorable prop- 
erties of optical tweezers and optical lattices. 
We use this platform to demonstrate spatial 
search by continuous-time quantum walks 
with neutral atoms. 

Because of their broad applicability, quan- 
tum walks have been realized in a number of 
experimental platforms, including with pho- 
tons (12-14), nuclear magnetic resonance (15), 
matter waves (16), trapped ions (17, 18), and 
superconducting qubits (19, 20). Optically 
trapped neutral atoms are particularly ame- 
nable to realizing quantum walks (21, 22) 
because they allow for high-fidelity creation 


and detection of individual, physically identi- 
cal walkers. One approach to studying quan- 
tum walks of neutral atoms is with quantum 
gas microscopes, which load degenerate gases 
containing thousands of particles into optical 
lattices containing thousands of sites (23, 24). 
Sophisticated techniques have been developed 
to “cookie-cut” desired initial states out of these 
complicated many-body states (22, 25, 26). A 
complementary approach is to rapidly assem- 
ble such states with optical tweezer arrays. In 
this case it is possible for individual atoms to 
be deterministically assembled into nearly 
arbitrary geometries (27, 28) and rapidly laser- 
cooled to their three-dimensional (3D) motional 
ground state (29-35). Pioneering studies have 
used tweezers to explore tunneling between 
up to eight sites (26, 36, 37); however, such 
systems are sensitive to disorder, making it 
difficult to realize coherent itinerance across 
many sites. We present an alternative solution 
that uses optical tweezers and high-fidelity 
laser cooling for fast, programmable implanta- 
tion and control of single atoms in a Hubbard- 
regime optical lattice. 

We use this approach to study quantum 
walks of individual atoms spanning hundreds 
of sites in a 2D lattice and to locally control 
those walks to realize a spatial search algo- 
rithm. In this case, elements of the search 
space are represented by sites in the lattice; 
the location of the atom, or walker, is ini- 


tialized through tweezer implantation and 
the search oracle is created with dynamically 
programmable tweezers superimposed on the 
lattice (4). The techniques demonstrated here 
may advance studies of nonequilibrium and 
ground-state Hubbard physics, where fast 
cycle times, versatile state preparation, and site- 
resolved potentials can advance entanglement- 
measurement protocols and implement certain 
sampling problems (8, 9, 38-40). These tools 
could be further extended to other types of 
systems, like molecules (34, 4), that are less 
amenable to evaporative cooling but are pow- 
erful in terms of many-body physics and quan- 
tum information. 

Our experiments begin by trapping and 
cooling individual strontium (°*Sr) atoms in 
optical tweezers (32). The atoms are then 
transferred into one 2D layer of a 3D optical 
lattice (Fig. 1A). This transfer need not be fully 
adiabatic with respect to on-site motional 
timescales because the tightly confining op- 
tical lattice allows for higher-fidelity optical 
cooling than can be performed in the tweezers, 
yielding a typical 3D motional ground state 
fraction of 100*?% (42). This lattice is composed 
of a 1D crossed-beam lattice which is aligned 
along gravity and a 2D bowtie lattice. The 
depths of these two lattices are indepen- 
dently tunable and are reduced to 26.0E;* 
and 5.0E2 , respectively, to study tunneling 
after optically cooling the atoms, where Ee 
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Fig. 1. Continuous-time 2D quantum walks with tweezer-implanted atoms 
in a lattice. (A) Individual ®8Sr atoms (solid blue sphere) are loaded and 
cooled in optical tweezers (preparation/oracle tweezer, green), and then 
implanted into single sites of a 3D optical lattice composed of a 1D crossed-beam 
lattice aligned along gravity (axial lattice, red half-disk), and a bowtie lattice 
(2D lattice, red). The 2D lattice contains more than 2000 sites that are 
compatible with high-fidelity imaging and ground state cooling (42) and is 
tunable to a regime where nearest-neighbor sites are coupled with a tunneling 
energy J, allowing the atoms to move through the lattice (translucent blue 


Tunneling time (ms) 


exhibits ballistic expansion and wavelike interference; this is in good agreement 
with theory with fitted values of the tunneling energy (right panels) up to an 
evolution time of 5 ms, where the atoms have coherently explored a region 
spanning ~200 lattice sites. Each pixel in these plots represents a single lattice 
site. (C) Tracing out the x axis of this 2D quantum walk yields a 1D quantum 
walk along the y axis (left panel), which is in good agreement with theory 
(right panel). Sampling times in these plots correspond to the leading edge of 
each colored region. For data to the left of the red line, multiple atoms can 

be implanted in different regions of the lattice for faster data collection (42). 


spheres). The preparation tweezers used to implant atoms can further be used to 
programmably modify the depth of individual sites in this lattice and a large- 
waisted tweezer (confinement tweezer, pink) can be used to apply a tunable 
harmonic confining potential that spans many lattice sites. (B) Atoms implanted 
in this lattice undergo continuous-time quantum walks in 2D, such that the 
probability density p corresponding to their measured position (left panels) 
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Here, each point in time is averaged over 200 repetitions of the experiment. For 
data to the right of the red line a single atom is implanted in the center 

of the lattice to avoid overlapping atomic wavefunctions and averaging over 
inhomogeneous regions in the lattice. Here, each point up to and including 

3.5 ms is averaged over 3000 repetitions. The points at 4 and 5 ms are averaged 
over 6000 and 14,000 repetitions, respectively. 
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and Ez” are the recoil energies of the two re- 
spective lattices (42). The detuning between 
sites in the axial lattice caused by gravity sup- 
presses tunneling along this axis, whereas atoms 
tunnel freely in the 2D lattice with an ave- 
rage tunneling energy of Jo/h = 2n x 163 Hz, 
corresponding to a characteristic tunneling 
time of t= //Jo = 0.975 ms (the tunneling 
energy differs slightly between the two axes of 
the lattice) (42). 

The evolution of the system in this regime 
can be understood as a quantum walk ona 
graph where each site |7) in the lattice is rep- 
resented by a node, and nodes corresponding 
to tunnel-coupled sites are connected by an 
edge. Such a graph can be represented by its 
adjacency matrix A, where A, = 0 unless nodes 
i and j are connected in which case A;; = 1. 
Given this definition the Hamiltonian of the 
system is: 


at = J) Asli) GI s » Vilé)(z|_— (A) 


where J is the tunneling energy; we have also 
included a local energy shift V; that is present 
as a result of the finite extent of our lattice 
beams (42) and can be programmably modi- 
fied through two sets of optical tweezers (Fig. 
1A). If the tweezers are fully extinguished, 
given the lattice uniformity, |V;|<J and this 
term can be disregarded. In this case, an atom 
implanted in one site of the lattice undergoes 
a continuous-time quantum walk in 2D (Fig. 
1B). The evolution of this atom’s wavefunc- 
tion is in good agreement with the theoretical 
prediction for a flat lattice with constant V; 
and a distant boundary, exhibiting the for- 
mation of fringes in the probability density p 
of the atom’s measured position thanks to 
interference between the multiple paths by 
which the atom can arrive at a given site after 
its evolution. This is in contrast to the behavior 
of classical random walks, which exhibit diffu- 
sive expansion of a Gaussian probability den- 
sity distribution. We can trace out one of the 
atom’s two spatial coordinates (Fig. 1C) which, 
given the form of H,, for a 2D square lattice, 
results in a 1D quantum walk along the re- 
maining axis. The resulting data are in good 
agreement with theory, showing the expected 
ballistic or light-cone-like spreading of the 
atomic probability density (7). For the latest 
time shown here of ¢t = 5.0 ms, the probability 
density continues to exhibit clear interference 
fringes (42), suggesting that the atom has 
maintained phase coherence while exploring 
a region spanning ~200 lattice sites. At later 
times the atom begins to sample the inho- 
mogeneous potential resulting from the finite 
extent of the lattice beams and the infinite flat 
lattice approximation breaks down (42). 

This coherent exploration of Hilbert space 
through continuous-time quantum walks can 
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Fig. 2. Adiabatic resource state preparation. (A) An atom implanted in the site with the lowest energy 

in a deep lattice with no tunneling (left callout), which we label as being in state |0), can be adiabatically 
connected to the ground state |s) of a shallow lattice with tunneling (right callout) through an adiabatic 
ramp of the tunneling energy (J) and the depth (Vc) of the confinement tweezer. In the callouts the 

gray dashed line denotes the potential provided by the confinement tweezer and the solid gray line indicates 
the sum of the tweezer and lattice potentials. The preparation tweezer used to initially implant the atom 

in |O) is shown schematically in green. By ramping J and Vc together we can maintain a roughly fixed energy 
gap AE between the ground and first excited states of the system throughout the ramp. (B) As an atom 
evolves under this ramp its amplitude spreads over many sites in the 2D lattice. Here, the x coordinate has 
been traced out for illustration purposes, showing the spread in the atom's y position (top). During this ramp, 
the population on the initial site (gray points) decreases and the overlap f (see main text) between the 
classical probability distributions corresponding to the prepared state |w) and the expected lattice ground 
state |s) (black points) increases, eventually reaching 76.9(3.3)%. This is in reasonable agreement with 
theory (solid lines) given the independently measured parameters in our ramp, and an overall scale factor to 
account for filtering and loss due to imperfect preparation of the atoms in their 3D motional ground 

states (42). The prepared state is not observed to substantially evolve even after more than 100 tunneling 


times (right side of the broken axis), suggesting that it is indeed the lattice ground state. 


Each data 


point corresponds to 500 repetitions of the experiment except for the points at 70 and 180 ms which are 


averaged over 1500 repetitions. 


be harnessed in a variety of quantum algo- 
rithms including those for spatial search 
(4). Spatial search is a specialization of the 
unstructured search problem with additional 
constraints on how the space can be explored. 
A continuous-time analog of Grover’s search 
algorithm (43) performs search by quantum 
walk in the limiting case of a fully connected 
graph. The problem becomes more difficult 
when edges are removed from the graph, 
which may preclude the quadratic Grover 
speedup depending on the graph structure. 
Surprisingly, quadratic speedup can persist 
even for much less well connected graphs. 
In particular, for N-vertex square lattice 
graphs the search can be performed in time 
O(/N polylog N ) in only four dimensions (4) 
whereas the full, optimal (43) quadratic speedup 
is recovered in five or more dimensions (4). 
Although the behavior of these single-particle 
walks can be captured by a classical wave 
equation involving N coupled oscillators (44), 
in the specific setting of searching a memory 
that is distributed in real space with a local 
probe (45), quantum walks of even a single 
particle can provide a notable quantum ad- 
vantage (42). 


In these algorithms the lack of structure 
in the search space suggests that a natural 
starting point is the uniform superposition 


|S) = ) l)/VN over all standard basis 


states in the relevant Hilbert space. Assuming 
periodic boundary conditions, this resource 
state |S) is precisely the ground state of Hy at 
with constant V;. To approximate this resource 
state we prepare the ground state |s) of Hyat 
(Fig. 2) in the presence of a potential V; pro- 
vided by an extra confinement tweezer (Fig. 1A). 
This tweezer has a nearly Gaussian profile 
with a fixed waist of 5.8a, where a is the 2D 
lattice spacing, and tunable overall depth 
Vo. At fixed J, the lattice ground state |s) is 
similar to |S) except with an approximately 
Gaussian envelope with a width determined by 
the value of Vo. 

To prepare the state |s) we implant an atom 
in the deepest site of the combined potential 
generated by the lattice and the confinement 
tweezer, which we label as being in state |0). 
This is the ground state of the system when 
the lattice is deep andJ«V¢. The state |0) can 
be adiabatically connected to the ground state 
|s) in a shallow lattice through a ramp of the 
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Fig. 3. Spatial search by continuous-time quantum walks. (A) When applying an oracle Hamiltonian 

Hy, = —Vw\w)(w| through a tweezer (green) that selects for an arbitrary marked site |w) with appropriate 
strength, the spectrum of the system includes ground and first excited states that are approximately 

even and odd superpositions of |w) and the lattice ground state or resource state |s) (42). Consequently, 
evolving |s) under Hy results in coherent oscillations between |s) and |w). (B) Measurements of the resulting 
oscillations in the populations on the marked sites (black points) are in good agreement with theory, up 

to an overall offset, with independently characterized values of the state preparation fidelity, tunneling 


ate, and confinement potential (black lines) (42). For comparison we also plot the population in |0), at the 
center of the lattice (gray points and theory curves). (Insets) measured populations versus position at 

the peak of these oscillations where the location of the relevant oracle is marked by a red point and the 
attice center by a cross. (C) The population on the marked site after a 2.46 ms quench is plotted for a 
selection of oracles (the lattice center is marked by a cross, and the position of each oracle by a red 
point, with the corresponding population shown in the surrounding circle. For comparison, the measured 
amplitudes in |s) are shown in the background). (D) Plotting the amplitudes from (C) as a function of oracle 


distance from the lattice center (black points, a = 813/./2 nm refers to lattice spacing) reveals quench 
behavior that is in good agreement with a theory prediction (black curve) with no free parameters (42). 


In a region spanning ~13 lattice sites the marked site can be found by looking for the highest amplitude site 
after the quench. At longer distances the amplitude on the marked site still increases from its value in |s) 
(theoretical and measured amplitudes in |s) shown in dark gray shading and points, respectively), but can be 
exceeded by the population at the origin in |s) (light gray shading), limiting the size of the search space. 
Color scale is shared across this figure and all data points are averaged over 500 to 600 repetitions 


of the experiment. 


tunneling energy (Fig. 2A). In practice we 
perform a linear ramp of the lattice depth Vy, 
as a function of time ¢ resulting in a nonlinear 
ramp in the tunneling energy (42). We also 
ramp the depth V¢ of the confinement tweezer 
to maintain an approximately constant value 
of the energy gap AF between the ground and 
first excited states during the ramp, which subs- 
tantially relaxes the requirements on adiabatic- 
ity and improves the fidelity with which we 
can prepare |s). The observed evolution during 
this ramp is in reasonable agreement with 
theory (Fig. 2B), where the atoms start out 
primarily in |0) and smoothly delocalize over 
many sites over the course of an adiabatic 
ramp of Vc and J that is 80 ms in length. The 
prepared state |) can be compared with |s) by 
computing the overlap between their pop- 
ulations, namely with the classical fidelity 


2 
i= (So. vPvPor) , where py, and p,; de- 


note the populations on site 7 in states |y) and 
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|s), respectively. This yields an upper bound on 
the fidelity with which we have prepared |s) 
of 76.9(3.3)% but does not certify any phase 
coherence between the amplitudes occupy- 
ing different sites. However, the prepared 
state is not observed to substantially evolve 
over more than 100 tunneling times and the 
adiabatic ramp can be reversed to recover 
57(5)% of the atoms in |0) (excluding loss 
caused by filtering of hot atoms) (42). This 
suggests that |s) has been prepared with a 
fidelity of 76(3)% in agreement with the bound 
set by the classical fidelity; we can thus pro- 
ceed with the search procedure. Note that in 
this work and the associated supplementary 
materials all confidence intervals refer to one 
standard error of the mean. 

The central idea in quantum-walk-based 
search algorithms is the presence of two com- 
peting terms in the Hamiltonian: a diffusion 
term Hj, corresponding to tunneling, whose 
ground state is |s), and an oracle term Hy = 


—V,»|w)(w|, whose ground state marks a 
specific site |w). H,, can be applied to the 
system with variable drive strength V,, but the 
choice of |w) is unknown to the experimenter 
and is the quantity being searched for. Given a 
sufficiently connected graph and appropriate 
choice of V,., the states |s) and |w) are similar 
in energy and coupled under the full search 
Hamiltonian H = Hy, + Hy. This results in 
ground and first excited states |+) that are 
approximately the even and odd superpositions 
of |s) and |w), and are separated by an energy 
gap AE = O(1/VN), where N is the number 
of elements in the search space (Fig. 3A) (42). 
As a result quenching to this Hamiltonian 
from the resource state |s) yields coherent 
oscillations to |w) and back with a characteris- 
tic period of O(N) that is independent of 
the specific choice of |w). Measuring the 
position of the walker after a half period of this 
oscillation identifies the marked site |w). This 
procedure can also be run backward to pre- 
pare |s) from an atom implanted in a pre- 
determined site |w’), avoiding any additional 
overhead associated with adiabatic resource 
state preparation (42). 

The choice of V,, must be carefully fine- 
tuned to minimize the energy gap between |+) 
given N and the connectivity of the graph (4). 
Here we choose V,. = 12.55(65)Jo, which is 
biased deeper than the optimal value to avoid 
certain sources of technical noise (42). Even 
with optimal V,, in a 2D square lattice with 
cyclic boundary conditions |+) deviate from the 
even and odd superpositions of |s) and |w) and 
the scaling of AE with N is modified, resulting 
in an asymptotic runtime of O(N /polylog NV) 
(4). This scaling is further modified by the 
nonperiodic boundary conditions in our ex- 
periment (42). Nonetheless, upon quenching 
to H starting in the state |s), we observe 
coherent oscillations in the population on the 
marked site |w) for a selection of different 
oracles (Fig. 3B). At the peak of these oscilla- 
tions the marked site is readily identified as 
the highest amplitude site in the lattice. Crit- 
ically, the amplitudes of these oscillations are 
in good agreement with theory and limited in 
magnitude not by technical noise but rather 
by the expected performance of this search 
procedure in a 2D square lattice (42). 

It should be noted that in the case of open 
boundary conditions or in the presence of 
the confinement tweezer the behavior of these 
oscillations is dependent on the specific posi- 
tion of |w) (42). This position-dependent 
behavior sets the effective size of the search 
space where at greater range reduced overlap 
between |w) and |s) yields oscillations with 
reduced amplitude. In our experiment, for 
an oracle with a distance of \/2a from the 
center of the confinement tweezer (the origin) 
the optimal evolution time after the quench is 
2.46 ms. Performing this quench and evolution 
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for a variety of different oracles (Fig. 3C), we 
find that the amplitude on the marked site 
after the evolution decreases as a function of 
distance from the origin, in agreement with 
theory (Fig. 3D). Within 2a—corresponding 
to aregion spanning ~13 lattice sites—we can 
blind ourselves to the position of the oracle 
tweezer and identify the marked site by look- 
ing for the most probable location of the walker 
after the quench. At longer range the amplitude 
at the origin after the quench and subsequent 
evolution can exceed that of the marked site. 
However, within //13a—corresponding to a 
region spanning ~45 lattice sites—there is still 
a several-fold increase in the amplitude on the 
marked site relative to what was present in the 
resource state. This suggests that the effective 
size of the search space could be increased 
with constant overhead by measuring the am- 
plitude of these oscillations over time. Although 
not demonstrated here this would help to reject 
any background signal that remains near the 
origin or effects relating to variable oscillation 
frequencies for different oracles. 

In this work we have performed a proof- 
of-principle demonstration of spatial search 
through continuous-time quantum walks of 
a single particle on a 2D square lattice. This 
is accomplished by introducing a platform 
that combines the programmability of optical 
tweezer arrays with a Hubbard-regime optical 
lattice which provides a clean environment for 
tunneling in addition to several thousand sites 
which are compatible with high-fidelity cool- 
ing, imaging, and coherent control (42, 46). 
Beyond studies with itinerance, these capa- 
bilities can also be used to prepare large, well- 
controlled ensembles of atomic qubits for 
quantum information, simulation, and metrol- 
ogy (35, 46, 47, 48, 49, 50). 

In the specific context of spatial search the 
runtime of the algorithm demonstrated in this 
work does not exhibit a quadratic speedup in 
comparison to classical search algorithms as a 
result of our use of a 2D square lattice (4). A 
runtime of O(VNlogN) is achievable with a 
single particle in such a lattice if an additional 
spin degree of freedom is used to implement 
a Dirac Hamiltonian (57) or a discrete-time 
quantum walk (52). This degree of freedom 
can be either internal to the walker or external 
and realized with a modified optical lattice 
containing an array of doublets (53). The op- 
tical clock qubit in strontium is a strong can- 
didate for implementing this spin internally 
because it is well controlled (46) and long lived 
compared with the tunneling time (35). More- 
over, it is possible to engineer state-dependent 
optical potentials for this qubit (54) to realize a 
broad class of discrete-time quantum walks 
(21, 55). 

Although there is a setting in which such 
single-particle quantum walks can provide a 
uniquely quantum advantage (45), this advan- 
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tage can be extended to a broader class of 
problems by realizing these dynamics in a 
system whose state space scales rapidly with 
physical resources. This can be achieved by 
extending this work to multiple particles, 
where the state space and thus graph size 
grows exponentially with particle number (42). 
Given the cooling performance and single- 
particle control demonstrated here such ex- 
periments could be performed with tens to 
hundreds of atoms, where the appropriate 
many-body oracle is applied through tunable 
Rydberg-mediated interactions (42, 46). Beyond 
spatial search the programmable control and 
assembly of large-scale itinerant systems en- 
abled by this platform provides one route 
toward programmable boson sampling with 
many particles (8, 9) as well as the direct 
assembly and characterization of Hubbard 
models (56, 57). 
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Omicron spike function and neutralizing activity 
elicited by a comprehensive panel of vaccines 


John E. Bowen'+, Amin Addetia't, Ha V. Dang”, Cameron Stewart’, Jack T. Brown’, William K. Sharkey’, 
Kaitlin R. Sprouse’, Alexandra C. Walls", Ignacio G. Mazzitelli*, Jennifer K. Logue®, Nicholas M. Franko°, 
Nadine Czudnochowski’, Abigail E. Powell, Exequiel Dellota Jr.2, Kumail Ahmed®, Asefa Shariq Ansari®, 
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Najeeha Talat Iqbal®, Davide Corti’, Jorge Geffner*, Gyorgy Snell”, Renata Grifantini“*, 


Helen Y. Chu®, David Veesler?>* 


The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Omicron variant of concern 
comprises several sublineages, with BA.2 and BA.2.12.1 having replaced the previously dominant BA.1 
and with BA.4 and BA.5 increasing in prevalence worldwide. We show that the large number of 
Omicron sublineage spike mutations leads to enhanced angiotensin-converting enzyme 2 (ACE2) binding, 
reduced fusogenicity, and severe dampening of plasma neutralizing activity elicited by infection or 
seven clinical vaccines relative to the ancestral virus. Administration of a homologous or heterologous 
booster based on the Wuhan-Hu-1 spike sequence markedly increased neutralizing antibody titers 

and breadth against BA.1, BA.2, BA.2.12.1, BA.4, and BA.5 across all vaccines evaluated. Our data suggest 
that although Omicron sublineages evade polyclonal neutralizing antibody responses elicited by primary 
vaccine series, vaccine boosters may provide sufficient protection against Omicron-induced severe disease. 


he ongoing COVID-19 pandemic has led 

to the emergence of severe acute respira- 

tory syndrome coronavirus 2 (SARS-CoV-2) 

variants with increased transmissibility, 

viral fitness, and immune evasion (J-10). 
The most recently named variant of concern, 
Omicron, is characterized by the greatest known 
genetic divergence from the ancestral virus 
(Wuhan-Hu-1) and consists of several subline- 
ages, including BA.1, BA.2, BA.3, BA.4, and 
BA.5. BA.1 was first detected in late 2021 and 
rapidly replaced Delta to become the globally 
dominant SARS-CoV-2 strain (3, 9, 11), aided 
by its high transmissibility and escape from 
neutralizing antibodies (6, 12-78). In early 
March of 2022, BA.2 became the most prev- 
alent SARS-CoV-2 variant globally (19, 20) 
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(Fig. 1A), and the proportion of BA.2.12.1in 
sequenced viruses peaked at >30% worldwide 
and >60% in the United States by late May of 
2022 (Fig. 1B). However, BA.4 and BA.5, which 
share the same spike (S) glycoprotein sequence, 
are expected to reach global dominance owing 
to their increasing prevalence and success- 
ful replacement of BA.2 in South Africa (27) 
(Fig. 1C). 

The receptor-binding domain (RBD) of the 
SARS-CoV-2 S glycoprotein interacts with the 
receptor angiotensin-converting enzyme 2 
(ACE2) (22-26), promoting S conformational 
changes that lead to membrane fusion and 
viral entry (27-29). S is the main target of 
neutralizing antibodies, which have been 
shown to be a correlate of protection against 
SARS-CoV-2 (30-38), with RBD-targeting anti- 
bodies accounting for most neutralizing activ- 
ity against vaccine-matched virus (36, 38) and 
nearly all cross-variant neutralizing activity 
(39). SARS-CoV-2 vaccines are based on the 
S glycoprotein [sometimes the RBD only 
(30, 40, 41)] or (inactivated) virus, and they 
utilize a variety of delivery technologies. Lipid- 
encapsulated prefusion-stabilized S-encoding 
mRNA vaccines include Moderna mRNA-1273 
and Pfizer-BioNTech BNT162b2. Viral-vectored 
vaccines encoding for the SARS-CoV-2 S se- 
quence include Janssen Ad26.COV2.S (human 
adenovirus 26), AstraZeneca AZD1222 (chim- 
panzee adenovirus), and Gamaleya National 
Center of Epidemiology and Microbiology 
Sputnik V (human adenovirus 26 and 5 for 
prime and boost, respectively). Novavax NVX- 
CoV2373 is a prefusion-stabilized S protein 
subunit vaccine formulated with a saponin- 
based matrix M adjuvant, whereas Sinopharm 


BBIBP-CorV comprises inactivated virions. The 
primary vaccine series consisted of two doses 
for all of these vaccines except for Ad26.COV2. 
S, which was administered as a single dose. 

We first aimed to understand how the dif- 
ferent S mutations in the Omicron variant sub- 
lineages affect host receptor engagement and 
membrane fusion. Whereas the Delta RBD 
recognized human ACE2 with a comparable 
affinity to that of the Wuhan-Hu-1 RBD [1.1- 
fold enhancement by biolayer interferometry 
(BLI) (3) and 1.5-fold enhancement by surface 
plasmon resonance (SPR)], the ACE2 binding 
affinity was greater for the BA.1 RBD (4.4-fold 
by BLI and 2.6-fold by SPR) (4, 72, 15) and for 
the BA.2 RBD (3.7-fold by BLI and 2.3-fold by 
SPR) (Fig. 1, D and §&; figs. S1 and $2; and 
tables S1 and S2). The BA.2.12.1 RBD—which 
differs from the BA.2 RBD only by the L452Q 
(substitution of leucine for glutamine at po- 
sition 452) mutation—had an ACE2 binding 
affinity similar to that of the Wuhan-Hu-1 
RBD (1.1-fold and 1.7-fold enhancements 
determined by BLI and SPR, respectively). 
The ACE2 binding affinity of the BA.4/BA.5 
(BA.4/5) RBD was the greatest among the 
RBDs evaluated in this work, with 6.1-fold 
and 4.2-fold increases relative to Wuhan-Hu-1, 
as determined by BLI and SPR, respectively 
(Fig. 1, D and E; figs. S1 and S2; and tables S1 
and S82). 

We next compared the kinetics and mag- 
nitude of cell-cell fusion mediated by the 
Wuhan-Hu-1/G614, Delta, BA, BA.2, BA.2.12.2, 
and BA.4/5 S glycoproteins using a split green 
fluorescent protein (GFP) system (42) with 
VeroE6/TMPRSS2 (VeroE6 cells stably ex- 
pressing TMPRSS2) target cells (expressing 
GFP B£ strands 1 to 10) and BHK-21 effector 
cells (expressing GFP B strand 11) and tran- 
siently transfected with S. We observed slower 
and markedly reduced overall fusogenicity for 
all tested Omicron sublineage S glycoproteins 
compared with Wuhan-Hu-1/G614 S and even 
more so relative to Delta S (75, 19) (Fig. 1, F to 
H; fig. S3; and movies S1 to S6), despite the 
higher apparent BA.4/5 S cell surface expres- 
sion compared with other S trimers (fig. S4). 

We next assessed the plasma neutralizing 
activity elicited in humans by each of the 
seven vaccines or SARS-CoV-2 infection and 
evaluated the immune evasion associated with 
the constellation of S mutations present in the 
BA.1, BA.2, BA.2.12.1, and BA.4/5 Omicron sub- 
lineages (table S3). We measured entry of vesic- 
ular stomatitis virus (VSV) pseudotyped with 
the SARS-CoV-2 Wuhan-Hu-1 S harboring 
the D614G, BA.1, BA.2, BA.2.12.1, or BA.4/5 
mutations into VeroE6/TMPRSS2 target cells 
(43) in the presence of vaccinee or convales- 
cent plasma (table $4). Plasma was obtained 
from individuals previously infected with a 
Washington-1-like SARS-CoV-2 strain based 
on time of infection. These samples were 
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Fig. 1. Omicron sublineage RBDs bind ACE2 with enhanced affinity but 
exhibit impaired S-mediated fusogenicity relative to the ancestral virus. 

(A to C) Prevalence of the different variants of concern measured globally (A), in 
the United States (B), or in South Africa (C). Alpha comprises B.1.1.7 and all 

Q sublineages; Delta comprises B.1.617.2 and all AY sublineages; and BA.1, BA.2, 
BA.4, and BA.5 comprise their respective sublineages (including BA.2.12.1 for 
BA.2). Prevalence calculations rely on shared GISAID (Global Initiative on Sharing 
nfluenza Data) sequences and may be biased by sampling. (D and 

E) Equilibrium dissociation constants (Kp) of binding of the monomeric 
ACE2 ectodomain to immobilized biotinylated Wuhan-Hu-1, Delta, BA.1, 

BA.2, BA2.12.1, and BA.4/5 RBDs assessed by BLI (D) or SPR (E). Data 
presented are the results of at least two independent biological replicates 


Avian 


obtained early in the pandemic, so individuals 
had not been vaccinated. We determined a 
plasma neutralizing geometric mean titer 
(GMT) of 39 against Wuhan-Hu-1/G614 VSV S 
pseudovirus, and only 5 of 24 individuals had 
detectable, albeit very weak, neutralizing ac- 
tivity against any of the four tested Omicron 
sublineages (Fig. 2 and fig. S5A). Plasma from 
subjects that received two doses of Moderna 
mRNA-1273 ~4 weeks apart had Wuhan-Hu-1/ 
G614, BA.1, BA.2, BA.2.12.1, and BA.4/5 S VSV 
neutralizing GMTs of 633, 33, 44, 30, and 22, 
respectively, whereas plasma from subjects 
that received two doses of Pfizer BNT162b2 
~3 weeks apart had neutralizing GMTs of 340, 
20, 29, 24, and 19, respectively (Fig. 2 and fig. 
S5, B and C). In total, 19 of 28, 21 of 28, 19 of 
28, and 16 of 28 mRNA-vaccinated subjects 
retained neutralizing activity against BA.1, BA.2, 
BA.2.12.1, and BA.4/5 S VSV, respectively. The 
combined Moderna and Pfizer cohorts experi- 
enced =18-fold, =13-fold, =17-fold, and =23-fold 
GMT reductions against BA.1, BA.2, BA.2.12.1, 
and BA.4/5 S VSV, respectively. A similar trend 
was observed for plasma from individuals that 
received two doses of Novavax NVX-CoV2373 
(44) in a double-blinded manner; however, 
these plasma samples were not obtained at 
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peak titers owing to the design of the clinical 
trial (figs. S6 and S7). From this group, we de- 
termined a neutralizing GMT of 252 against 
Wuhan-Hu-1/G614 S VSV with only 2 of 10 in- 
dividuals having detectable neutralizing activ- 
ity against BA.1 (GMT: 12, =22-fold drop), 7 of 
10 against BA.2 (GMT: 15, =16-fold drop), 4 of 
10 against BA.2.12.1 (GMT: 13, =20-fold drop), 
and 1 of 10 against BA.4/5 (GMT: 11, =23-fold 
drop) (Fig. 2 and fig. S5D). Plasma from indi- 
viduals vaccinated with Janssen Ad26.COV2.S 
were obtained 9 to 142 days (mean, 79) after 
their single dose—a time frame expected to 
capture peak neutralizing titers (45). This re- 
sulted in a Wuhan-Hu-1/G614 S VSV GMT of 
55, and only 1 of 12 subjects had detectable 
plasma neutralizing activity against any of 
the Omicron sublineages (Fig. 2 and fig. S5E). 
Two doses of AZD1222 4 weeks apart induced 
Wuhan-Hu-1/G614, BA.1, BA.2, BA.2.12.1, and 
BA.4/5 S VSV neutralizing GMTs of 165, 14, 
19, 15, and 14, respectively, with 13 of 16 and 4 of 
16 individuals having detectable neutralizing 
activity against any or all tested subvariants, 
respectively (Fig. 2 and fig. SSF). Sputnik V vac- 
cinee plasma after two doses had Wuhan-Hu-1/ 
G614, BA1, BA.2, BA.2.12.1, and BA.4/5 S VSV 
GMTs of 69, 13, 17, 14, and 11, respectively (Fig. 2 


Time after coculture (hrs) 


for BLI and for SPR (except for the BA.1 RBD SPR data, which come from 
four technical replicates). (F) Quantification of cell-cell fusion after 24 hours 
mediated by Wuhan-Hu-1/G614, Delta, BA.1, BA.2, BA.2.12.1, and BA.4/5 S 
glycoproteins expressed as the fraction of the total area with GFP 
fluorescence assessed using a split GFP system. Data are from 16 fields of 
view from a single experiment and are representative of results obtained 
from two independent biological replicates. Comparisons between fusion mediated 
by the Wuhan-Hu-1/G614 S and other S variants were completed using the 
Wilcoxon rank sum test. **P < 0.01; ***P < 0.001; ****P < 0.0001. (G) Kinetics 
of cell-cell fusion mediated by Wuhan-Hu-1/G614, Delta, BA.1, BA.2, BA.2.12.1, 
and BA.4/5 S glycoproteins expressed as the fraction of the total area with 
GFP fluorescence assessed using a split GFP system. 


and fig. S5G). Detectable neutralizing activ- 
ity against any or all Omicron sublineages was 
observed for 7 of 13 and 2 of 13 individuals, 
respectively. Finally, plasma from subjects vac- 
cinated with two doses of Sinopharm BBIBP- 
CorV had a neutralizing GMT against G614 
S VSV of 135, with 3 of 12 samples retaining 
detectable neutralizing activity against BA.1 
(GMT: 14), 7 of 12 against BA.2 (GMT: 17), 5 of 
12 against BA.2.12.1 (GMT: 15), and 4 of 12 
against BA.4/5 (GMT: 11) (Fig. 2 and fig. S5H). 
Overall, these data underscore the magnitude 
of evasion of polyclonal plasma neutralizing 
antibody responses for Omicron sublineages 
in humans after primary vaccine series or in- 
fection [resulting from the accumulation of 
S mutations (72)], with a subtle but consis- 
tently more marked effect for BA.1 and even 
more so for BA.4/5 compared with BA.2 and 
BA.2.12.1. 

The emergence of the SARS-CoV-2 Delta 
and subsequently Omicron variants of con- 
cern led to an increasing number of reinfections 
and vaccine breakthrough cases (5, 46-48). 
Public health policies were therefore updated 
worldwide to recommend administration of 
an additional vaccine dose (booster) several 
months after the primary vaccine series, which 


19 AUGUST 2022 * VOL 377ISSUE 6608 891 


RESEARCH | REPORTS 


-o- 


Fig. 2. SARS-CoV-2 Omicron 
sublineages evade human 
plasma neutralizing antibodies 
elicited by infection or primary 
vaccine series. Plasma 
neutralizing antibody titers 
elicited by primary COVID-19 
vaccination series determined 
using SARS-CoV-2 S VSV 
pseudotypes using VeroE6/ 
TMPRSS2 as target cells. One- 
time (1x) infected samples (n = 
24) were obtained 26 to 78 days 
(mean, 41) after symptom onset, 
two-dose (2x) mRNA-1273 
samples (n = 14) were obtained 


Neutralization (1/ID59) 


—@- 2xmRNA-1273 


1x Infected —@- 2x BNT162b2 


—@- 2x NVX-CoV2373 


>18x 
314 


—o- 
—@- 2x AZD1222 


1x Ad26.COV2.S =@= 2x Sputnik V 


=O- 2x BBIBP-CorV 


6 to 50 days (mean, 13) after 
second dose, 2x BNT162b2 
samples (n = 14) were obtained 
6 to 33 days (mean, 14) after 
second dose, 2x NVX-CoV2373 


samples (n = 10) were obtained 17 to 


168 days (mean, 82) after second dose, one-dose (1x) Ad26.COV2.S samples 
(n = 10) were obtained 9 to 142 days (mean, 79) after first dose, 2x AZD1222 
samples (n = 16) were obtained ~30 days after second dose, 2x Sputnik V 
samples (n = 12) were obtained 60 to 90 days after second dose, and BBIBP- 
CorV samples (n = 12) were obtained 9 to 104 days (mean, 69) after second 
dose. Individual points are representative geometric mean titers from 


has been shown to increase the breadth and 
potency of neutralizing antibodies (5, 12, 17, 49). 
We thus assessed and compared the benefits 
provided by homologous or heterologous vac- 
cine boosters on vaccinee plasma neutralizing 
activity against Wuhan-Hu-1/G614, BA.1, BA.2, 
BA.2.12.1, and BA.4/5 S VSV pseudotypes. Plas- 
ma samples of subjects that received three 
mRNA vaccine doses had neutralizing GMTs 
of 2371, 406, 448, 472, and 392 against Wuhan- 
Hu-1/G614, BA.1, BA.2, BA.2.12.1, and BA.4/5 S 
VSV, respectively (Fig. 3 and fig. S8A). The five- 
to sixfold potency losses against these variants 
are marked improvements over the >13- to 
>23-fold reductions observed after two vaccine 
doses, underscoring an increase in overall neu- 
tralizing antibody potency and breadth (5, 12). 
Individuals vaccinated with two doses of NVX- 
CoV2373 followed by a booster of mRNA-1273 
(1 of 5 individuals) or NVX-CoV2373 (4 of 
5 individuals) had neutralizing GMTs of 6978 
for Wuhan-Hu-1/G614, 505 for BA.1 (14-fold 
reduction), 948 for BA.2 (sevenfold reduction), 
935 for BA.2.12.1 (sevenfold reduction), and 
330 for BA.4/5 (21-fold reduction) (Fig. 3 and 
fig. SSB). Plasma from individuals who received 
one dose of Ad26.COV2.S followed by either a 
homologous Ad26.COV2.S (12 of 14 individuals) 
or a heterologous BNT162b2 booster (2 of 14 in- 
dividuals) ~4 months later had neutralizing 
GMTs of 363, 23, 50, 46, and 29 against Wuhan- 
Hu-1/G614, BA.J, BA.2, BA.2.12.1, and BA.4/5 S 
VSV, respectively, corresponding to damp- 
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each group. Statistical sig 


demographics are shown 
fig. S5. G614 indicates Wu 


ening ranging between =7- and =16-fold with 
9 of 14 individuals maintaining neutralizing 
activity against all sublineages (Fig. 3 and 
fig. S8C). We also investigated individuals that 
received two doses of AZD1222 4 weeks apart 
followed by a BNT162b2 (17 of 18 individuals) 
or mRNA-1273 (1 of 18 individuals) booster 
~6 months later. This cohort had respective 
neutralizing GMTs of 2167, 186, 269, 273, and 
135 against Wuhan-Hu-1/G614, BA.1, BA.2, 
BA.2.12.1, and BA.4/5 S VSV, corresponding to 
8- to 16-fold potency reductions (Fig. 3 and fig. 
S8D). Individuals vaccinated with two doses 
of Sputnik V and boosted with AZD1222 (11 of 
12 individuals) or BNT162b2 (1 of 12 individ- 
uals) ~9 months later had neutralizing GMTs 
of 351, 68, 77, 72, and 35 for Wuhan-Hu-1/ 
G614, BA.1, BA.2, BA.2.12.1, and BA.4/5, respec- 
tively, amounting to 5- to 10-fold reductions 
of potency (Fig. 3 and fig. SSE). BBIBP-CorV 
vaccinees boosted with either BNT162b2 (14 of 
18 individuals) or mRNA-1273 (4 of 18 indi- 
viduals) had GMTs of 2047 for G614, 439 for 
BA.1 (fivefold reduction), 375 for BA.2 (five- 
fold reduction), 430 for BA.2.12.1 (fivefold 
reduction), and 252 for BA.4/5 (eightfold re- 
duction) (Fig. 3 and fig. SSF). The marked 
improvement in plasma neutralizing activity 
for subjects that received a booster dose over 
those that did not highlights the importance 
of vaccine boosters for eliciting potent neu- 
tralizing antibody responses against Omicron 
sublineages. 


BA.2.12.1 
BA.2.12.1 


two independent experiments consisting of two replicates each. Bars represent 
geometric means, and error bars represent geometric standard deviations for 


nificance between groups of paired data was 


determined by Wilcoxon rank sum test. ***P < 0.001; ****P < 0.0001. Patient 


n table S4. Normalized curves and fits are shown in 
han-Hu-1/G614. IDso, median inhibitory dose. 


To assess the effect of target cell lines on 
apparent Omicron immune escape, we com- 
pared the aforementioned VeroE6/TMPRSS2 
cells (43) with a stable ACE2-overexpressing 
HEK298T cell line (HEK293T/ACE2) (50) to 
determine plasma neutralizing activity for a 
cohort of mRNA-vaccinated individuals. After 
primary vaccine series, only three subjects had 
detectable neutralizing activity against any 
of the tested Omicron sublineage VSV pseudo- 
types when using HEK293T/ACE2 target cells. 
By contrast, all but one subject had detect- 
able, albeit very weak, neutralizing activity 
against Omicron VSV pseudotypes using VeroE6/ 
TMPRSS2 target cells, resulting in >17-fold, 
>14-fold, >20-fold, and >22-fold reductions 
against BA.1, BA.2, BA.2.12.1, and BA.4/5, re- 
spectively (figs. S9A and S10A). After a booster 
dose, we observed respective 7-fold, 7-fold, 
11-fold, and 13-fold reductions of neutralizing 
activity against BA.1, BA.2, BA.2.12.1, and BA.4/ 
5 VSV pseudotypes using HEK293T/ACE2 tar- 
get cells, as compared with respective seven- 
fold, sixfold, fivefold, and eightfold reductions 
when using VeroE6/TMPRSS2 target cells (figs. 
S9B and S10B). This indicates that the target 
cell lines used in neutralization assays may 
affect the observed plasma neutralizing escape 
of SARS-CoV-2 variants, which may be further 
compounded on the basis of preferential entry 
routes (15, 23, 51). 

We report that the BA.1, BA.2, BA.2.12.1, and 
BA.4/5 Omicron sublineages, which account 
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Fig. 3. Administration of a booster dose rescues neutralization potency against Omicron sublineages 
for all vaccines. Plasma neutralizing antibody titers elicited by COVID-19 vaccine boosters determined using 
SARS-CoV-2 S VSV pseudotypes and VeroE6/TMPRSS2 as target cells. Three-dose (3x) mRNA-1273 or 
BNT162b2 samples (n = 13) were donated 13 to 97 days (mean, 30) after third dose; 2x NVX-CoV2373 
plus lx BNT162b2 or NVX-CoV2373 samples (n = 5) were donated 14 to 20 days (mean, 15) after 

third dose; 1x Ad26.COV2.S plus 1x Ad26.COV2.S or BNT162b2 samples (n = 14) were donated 12 to 16 days 
(mean, 14) after second dose; 2x AZD1222 plus 1x BNT162b2 or mRNA-1273 samples (n = 18) were 
donated 30 to 123 days (mean, 87) after third dose; 2x Sputnik V plus 1x AZD1222 or BNT162b2 samples 
(n = 14) were donated 45 to 60 days after third dose; and 2x BBIBP-CorV plus 1x BNT162b2 or 
mRNA-1273 samples (n = 18) were donated 29 to 89 days (mean, 50) after third dose. Individual points 
are representative geometric mean titers from two to four independent experiments consisting of 

two replicates each. Bars represent geometric means, and error bars represent geometric standard 
deviations for each group. Statistical significance between groups of paired data was determined by 
Wilcoxon rank sum test. **P < 0.01; ***P < 0.001; ****P < 0.0001. Patient demographics are shown in 
table S4. Normalized curves and fits are shown in fig. S8. 


for >99% of all infections worldwide over the 
first half of 2022, have increased ACE2 binding 
affinity, have decreased fusogenicity, and mark- 
edly evade neutralizing antibody responses 
relative to the Wuhan-Hu-1 and Delta strains 
(3). Collectively, these data suggest that en- 
hanced receptor engagement and immune 
evasion are key changes that may have pro- 
moted the rapid spread of these Omicron sub- 
lineages and could contribute to the current 
rise in prevalence of BA.4 and BA.5. 

The development of life-saving vaccines is 
regarded as one of humanity’s greatest medi- 
cal and scientific achievements, which is ex- 
emplified by COVID-19 vaccines (52-54). Primary 
COVID-19 vaccine regimens or infection- 
elicited plasma neutralizing activity was se- 
verely dampened by Omicron sublineages BA.1, 
BA.2, BA.2.12.1, and BA.4/5. However, admin- 
istration of a booster dose increased neutral- 
izing antibody titers and breadth against all 
Omicron sublineages to appreciable levels re- 
gardless of the vaccine evaluated, concurring 
with findings for BA.1 (5, 12, 15, 17, 49, 55, 56). 
These results are consistent with previous 
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studies demonstrating that a third vaccine 
dose results in the recall and expansion of 
preexisting SARS-CoV-2 S-specific memory 
B cells, as well as de novo induction, leading 
to production of neutralizing antibodies with 
enhanced potency and breadth against var- 
iants (57, 58). Vaccinees receiving two doses 
of Ad26.COV2.S (4 months apart) had lesser 
Omicron immune escape than that in other 
two-dose vaccine recipients (3 to 4 weeks be- 
tween doses) but greater than that observed 
in three-dose vaccinees. These findings sug- 
gest that the time interval between immuniza- 
tions may affect the breadth and potency of 
vaccine-elicited plasma neutralizing activity 
and that a third dose may be beneficial for this 
cohort as well (59-62). Moreover, the induc- 
tion by several currently available vaccines of 
robust cross-reactive cellular immunity against 
SARS-CoV-2 Omicron is likely playing a key 
role in the retained protection observed against 
severe disease (63, 64). 

As SARS-CoV-2 progressively becomes en- 
demic in the human population, vaccination 
strategies will need to be carefully considered 


and optimized to provide long-lasting immu- 
nity. So far, elicitation of high titers of variant- 
neutralizing antibodies and protection against 
severe disease can be accomplished by dosing 
with the Wuhan-Hu-1 S antigen, as shown in 
animal models and studies of vaccine efficacy 
in humans (5, 48, 65, 66). In fact, an Omicron 
BA.1 (or other variant) S boost does not offer 
mice or nonhuman primates significantly more 
BA.1 protection than a Wuhan-Hu-1 S boost 
(67-71), and Omicron primary infections elicit 
neutralizing antibody and memory responses 
of narrow breadth (72-74). However, continued 
SARS-CoV-2 evolution will accentuate the anti- 
genic divergence from the ancestral strain, 
and it is unknown whether vaccines based 
on Wuhan-Hu-1 S alone will provide satisfac- 
tory protection, either as boosters in vacci- 
nated or infected individuals or as an initial 
vaccine in naive individuals (mainly children). 
The recent evaluation of intranasal vaccine 
administration could also be important to 
not only prevent severe disease but also curtail 
viral infection and transmission through in- 
duction of mucosal immunity (75-78). For these 
reasons, it is important to monitor new var- 
iants, assess the effectiveness of currently 
available vaccines, and continue to test and 
implement new vaccination strategies that 
may provide stronger, longer-lasting, or broader 
protection against SARS-CoV-2 and the entire 
sarbecovirus subgenus (40, 79, 80). 
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up programme, the Excellence Strategy. 


The Zukunftskolleg is an Institute for Advanced Study for early career researchers in 


The collaborative and interactive Arizona postdoctoral fellowship program he natural sciences, humanities and social sciences. It is an central component of the 


in Neurobiology of Aging, Alzheimer’s Disease and Related Disorders that 
includes University of Arizona, Arizona State University, Barrow Neurological 
Institute, Translational Genomics Research Institute, Banner Alzheimer’s 
Institute, Banner Sun Health Research Institute, and Mayo Clinic components 
of the statewide Arizona Alzheimer’s Consortium is seeking highly qualified 
applicants for positions supported by a National Institutes of Aging 
training grant. 


Training programs are intended to advance researcher development and 
studies in Alzheimer’s disease, related disorders and non-pathological brain 
aging. Fellows can capitalize on programs in basic, cognitive, behavioral, 
clinical, neuropathological, brain imaging and omics research. Mentors have 
expertise in molecular biology, experimental, and neuropathological research, 
the study of transgenic and wild type mice, non-human primates, brain imaging, 
genomics, the computational and statistical analysis of complex data sets, 


experimental therapeutics, the early detection, diagnosis, treatment and 
prevention of Alzheimer’s disease and related diseases, the development and 
application of new research methods, and other potential areas of interest. 
Trainees will benefit from a Professional Development Committee to foster 
career development, provide opportunities to work and learn across institutions 
all in an incredible place to live. 


This is a U.S. Government funded training grant and is only available to U.S. 
citizens or Permanent Residents. All training program institutions value an 
inclusive academic community, which advances innovation and critical thinking. 
The program encourages applications from diverse candidates including people 
of color, women, LGBTQIA2S+, veterans, and individuals with disabilities. 


Please send a CV, contact information for two references that have agreed to be 
contacted, anda letter describing your research interests to the program Admissions 
Director Dr. Elliott Mufson, Professor of Translational Neurosciences at the 
Barrow Neurological Institute c/o Katelyn.Kildoo@DignityHealth.org. 


university's strategy ,creative.together®. The Zukunftskolleg promotes interdisciplinary 
discourse between distinguished researchers from Germany and abroad, and provides 
hem with the resources to conduct their research independently and free from ad- 
ministrative constraints. The working language is English. 


The Zukunftskolleg of the University of Konstanz offers: 


One 5-year Research Fellowship 


(Salary Scale TV-L 14) 

to develop and implement individual research projects and build up their own 
research group. 

Fellowships will begin on June 1, 2023. 

Reference number 2022/199 


One 2-year Postdoctoral Fellowship 


(Salary Scale TV-L 13) 


to develop and implement individual research projects. Conditionally on the submission 
of an external grant, the position can be extended for an additional year. 

Fellowships will begin on May 1, 2023. 

Reference number 2022/198 


Applications, supporting materials and two letters of reference should 
be submitted in English by 31 October 2022 11:00 am (Konstanz 
time) using the Online Application Platform 
https://www.uni-konstanz.de/zukunftskolleg/online-application/. 


Details concerning the application, required documents, and informa- 
Zubenftwholleg tion about the Zukunftskolleg are available on our website: 
https://www.uni-konstanz.de/zukunftskolleg/ . 
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WORKING LIFE 


By Kaitlin Rasmussen 


898 


My life as a photon 


rowing up, I asked a lot of questions. Many of them foretold my future in astronomy: Why is 
the Sun yellow? Why do the constellations look like that? Why does Jupiter have a spot? My 
parents answered what they could, and bought me books to answer the rest. But my most fre- 
quent question, starting when I was about 5 years old, was why am IJ a girl? And for that, my 
parents had no answer. In fact, in the 1990s, in the foothills of the Appalachians, no one did. 
It was my first encounter with a question that has no simple answer. 


It faded in and out of my conscious- 
ness until middle school, when pu- 
berty brought feelings of dismay and 
disgust. Every day meant perform- 
ing femininity while feeling increas- 
ingly isolated from it. 

On the academic front, things 
weren’t much better. In high school, 
I particularly hated freshman phys- 
ics and its inflexible rules, which 
seemed to mirror the society I lived 
in. The same way we learned to ex- 
pect a pendulum to swing, a block 
to slide down a slope, a ball to fall 
off a cliff, I felt I was expected to go 
to church, meet a man, marry young, 
and have kids. This seemingly 
inescapable trajectory left me de- 
pressed and numb to the world. 

I went to college because I wanted 
out. I felt purposeless, hollow, 
void of meaning—but at the same 
time, standing at the crossroads to 
1000 different futures. Like a particle atop a_ perfect 
sphere, I could have fallen in any direction. It was by pure 
chance that I wandered into a bookstore and saw Stephen 
Hawking’s The Grand Design on the front table. I cannot tell 
you what inspired me to pick up a book on cosmology. But I 
did, and in a few short minutes I had discovered a doorway 
into a new kind of physics—the kind of physics that doesn’t 
have all the answers, the kind of physics that disagrees with 
itself, the kind of physics that is messy and chaotic and, God 
forbid, fun. I changed my major to astrophysics the next week. 

Over the following years, I learned about relativity, and 
how in the right circumstances time itself can slow. I learned 
about quantum mechanics, where anything can happen. 
Rules were no longer absolute. Things I had accepted as fact 
were really just approximations of unknowable truths. 

In college, I would also hear the word “transgender” for the 
first time. I would meet queer folks in loving relationships. It 
was drastically different from my first brush with queerness— 
an encounter with a slur on a sign wielded by members of 
the Westboro Baptist Church, who came to my hometown to 


“Being nonbinary means 
challenging the 
status quo every day.” 


demonstrate when I was 13. At the 
end of college, I would realize that I 
myself am bisexual—attracted to my 
own gender as well as others, just as 
gravity draws every single thing in 
the universe to every other thing. It 
felt natural, like I had found a lower 
energy state of existence. Yet I still 
wasn’t in my ground state. 

That finally happened halfway 
through graduate school, when I 
found the label “nonbinary” through 
friends on Twitter. With its fluidity 
and disavowal of the traditional two- 
gender system, nonbinary felt right. 
It felt like home. It felt like I had 
spent my whole life trying to solve a 
chaotic system only to realize there 
wasn’t one answer, but many. It was 
then I realized that I am a photon— 
possessing qualities inherent to ei- 
ther side of the binary, but ultimately 
belonging to neither. 

It hasn’t always been easy. I lived a bit of a double life for a 
while, authentic in my internet presence and closeted while I 
finished my Ph.D. But when I was ready, embracing my iden- 
tity brought me into an incredible community. 

Now, I realize the power of my identity. Being nonbinary 
means challenging the status quo every day. It means ev- 
erything can and must be questioned. It means exploring 
things others take to be fundamental in new ways from new 
angles. In my everyday life, my gender identity compels 
me to find unconventional solutions to difficult problems. 
I turn over unseen stones. I try unorthodox methods. I 
wrestle with big, fundamental questions. All of these things 
make me a better scientist. 

Physics is always evolving, and gender is, too. When we 
understand that things are more complex than they appear, 
we learn. When scientists embrace the complexity of the uni- 
verse, our science can only improve. 


Kaitlin Rasmussen is a postdoc at the University of Washington, Seattle, 
and amember of the International Society of Nonbinary Scientists. 
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Your Legacy to Science 


AN ESTATE GIFT TO THE 
AMERICAN ASSOCIATION FOR THE ADVANCEMENT OF SCIENCE 


Since 1848, our founding year, the American Association for the 
Advancement of Science (AAAS) has been deeply committed to 
advancing science, engineering and innovation around the world 


for the benefit of all people. 


By making AAAS a beneficiary of your will, trust, retirement plan or 
life insurance policy, you become a member of our 1848 Society, 
joining Thomas Edison, Alexander Graham Bell and the many 
distinguished individuals whose vision led to the creation of AAAS 


and our world-renowned journal, Science, so many years ago. 


Unlike many of its peers, Science is not for-profit. Your estate gift 
would provide long-term financial stability and durable annual 
income that will support operations and competitive innovation 


for years to come. This support is vital. 


Yes, | would like more information about joining the AAAS 1848 Society. 


PLEASE CONTACT ME AT: 
Name: 

Address: 

City: State: 
Email: 


RETURN THIS FORM TO: 


“As a teacher and instructor, | bear responsibility for the 
younger generations. If you have extra resources, concentrate 


them on organizations, like AAAS, that are doing work for all.” 


—Prof. Elisabeth Ervin-Blankenheim, 1848 Society member 


If you intend to include AAAS in your estate plans, provide this 


information to your lawyer or financial adviser: 


Legal Name: American Association for the Advancement of Science 
Federal Tax ID Number: 53-0196568 
Address: 1200 New York Avenue, NW, Washington, DC 20005 


If you would like more information on making an estate gift to 


AAAS, cut out and return the form below or send an 


1348 


SOCIET ¥ 


email to philanthropy@aaas.org. Additional 
details are also available online at 


www.aaas.org/1848Society. 


MAVAAAS 


Zip code: Country: 


Phone: 


AAAS Office of Philanthropy and Strategic Partnerships * 1200 New York Avenue, NW + Washington, DC 20005 USA 


It’s time to think 


Find out how NEB can support your infectious disease research and development. 


Gaining a better understanding of infectious diseases, / 
including their characterization, evolution and transmission, 

continues to be a priority, both from an R&D standpoint 

and as a public health issue. The COVID-19 pandemic has 
demonstrated the need for a wide range of tools to research | 
infectious diseases, and has highlighted the importance 

of speed and the ability to pivot as new problems arise. 

This has emphasized the need for innovation and thinking 

differently about where to access those critical materials, 

including genomics reagents. 


Many scientists know NEB as a trusted reagent supplier to 
the life science community, but what you may not know is 
that we also offer a portfolio of products that can be used 
in infectious disease research, development of diagnostics 
and therapies, and in epidemiological studies and disease 
surveillance. In fact, many of our products have supported 
the development of COVID-19 diagnostics and vaccines, 
and can also be utilized with other infectious diseases, such 


as influenza and malaria. 


“GMP-grade” is a branding term NEB uses to describe reagents manufactured at our Rowley, MA facility, where we utilize procedures and process controls to 
manufacture reagents in compliance with ISO 9001 and ISO 13485 quality management system standards. NEB does not manufacture or sell products known 
as Active Pharmaceutical Ingredients (APIs), nor do we manufacture products in compliance with all of the Current Good Manufacturing Practice regulations. 


One or more of these products are covered by patents, trademarks and/or copyrights owned or controlled by New England Biolabs, Inc. For more information, 


please email us at busdev@neb.com. The use of these products may require you to obtain additional third party intellectual property rights for certain applications. 


© Copyright 2022, New England Biolabs, Inc,; all rights reserved. 


differently. 


Benefit from almost 50 years of experience in 
molecular biology & enzymology 


Partner with our OEM & Customized Solutions 
team to find the best solution for your needs 


Take advantage of our expanded manufacturing 
capabilities 


Access product formats, such as GMP-grade’, 
lyophilized, lyo-ready and glycerol-free 


Be confident in your product performance with 
our expanded quality and regulatory systems 


Ready to get started? Learn more at 
www.neb.com/InfectiousDiseases 


NEW ENGLAND 


ioLabs:. 


be INSPIRED 
drive DISCOVERY 
stay GENUINE 


